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The research leading to the present invention was supported, at least in part, by a grant 
from the National Institutes of Health, Grant No. GM59432-01. Accordingly, the 
1 5 Government may have certain rights in the invention. 



The present invention provides the three-dimensional structure of a complex between 
20 the phosphotyrosine binding domain (PTB) of Sue 1 -associated neurotrophic factor 
target protein ( SNT) and the SNT binding site of the fibroblast growth factor receptor 
(FGFR). The three-dimensional structural information is included in the invention. 
The interaction between SNT and FGFR plays a key role in the regulation of cell 
proliferation. Therefore, the present invention provides procedures for identifying 
25 agents that can inhibit tumor proliferation through the use of rational drug design 
predicated on the three-dimensional data provided herein. 



30 The discovery and the extensive biochemical and biophysical characterization of the 
evolutionarily conserved cytoplasmic protein modular domains (such as SH2, SH3, 
PH, and WW domains) have revolutionized the way protein-protein interactions in 
cellular signal transduction is understood [Pawson and Scott, Nature 278: 2075-2080 
(1997)]. Since these protein module-mediated interactions play an important role in 

35 regulating numerous cellular processes including cell growth, proliferation, 
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differentiation and apoptosis, studies of the protein domain-mediated pathways have 
revealed the molecular basis of various human diseases. This has led to the discovery 
of new drug targets for treatment of these diseases. 

5 The Phosphorylated Tyrosine Binding (PTB) domain (also called PID or SAIN 
domains) is another evolutionarily conserved cytoplasmic protein modular domain 
involved in cellular signal transduction. The PTB domain was first identified as a 
protein module that could bind phosphorylated tyrosines, and thus was classified as an 
alternative to the Src homology 2 (SH2) domain [Blaikie et al J. Biol. Chem. 

10 269:32031-32034 (1994); Kavanaugh, and Williams, Science 266:862-1865 (1994); 
O'Neill et al, Mol Cell Biol 14: 6433-6442 (1994)]. However, PTB domains are 
structurally and functionally distinct from SH2 domains, and recognize amino acid 
residues amino-terminal (rather than carboxyl-terminal) to the phosphotyrosine (pY) 
[Zhou and Fesik, S. W. Prog. Biophys. Molec. Biol 64:221-235 (1995)]. In particular, 

1 5 PTB domains preferentially bind to phosphorylated proteins at sites containing an 
NPXp Y motif and hydrophobic amino acids amino-terminal to this sequence 
[Gustafson et al Mol Cell Biol 15:2500-2508 (1995); Kavanaugh et al Science 
268:1177-1179 (1995); Zhou etal 9 J. Biol Chem. 270:31119-31123 (1995); Triib et 
al, J. Biol Chem. 270:18205-18208 (1995); Singyang, J. Biol Chem. 270:14863- 

20 14866 (1995)]. Moreover, unlike SH2 domains, PTB domains typically show very 
low protein sequence homology. Different PTB domains exhibit distinct selectivity 
for residues amino-terminal to the NPXpY motif (SEQ ID NO:6). For example, the 
PTB domain of the insulin receptor substrate 1 (IRS-1) favors hydrophobic residues at 
different structural locations of its receptor, than those preferred by the PTB domain 

25 of the adaptor protein She. Recent studies demonstrate that the PTB domains of XI 1 
and Numb can also recognize sequences related to the NPXpY motif in tyrosine- 
phosphorylation and non-phosphorylation-dependent manners [Li et al Proc. Natl 
Acad. ScL USA 94: 7204-7209 (1997); Li et al Nat., Struct. Biol 5:1075-1083 
(1998); and Zhang et al, EMBO J. 16:6141-6150 (1997)]. 



The NMR structural analyses of the She and IRS-1 PTB domains revealed the 
detailed structural basis of their protein recognition [Zhou et al, Nature 378:584-592 
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(1995); and Zhou et al, Nature Struct. Biol. 3:388-393 (1996)]. Despite their very 
low sequence homology, the PTB domains of She and IRS-1 consist of a conserved 
pleckstrin (PH) domain fold, i.e. a P-sandwich containing two nearly orthogonal, 
anti-parallel P-sheets capped at one end by an amphipathic C-terminal a-helix. 
5 Furthermore as stated above, the structurally related PTB domains of She and IRS-1 
employ two very different mechanisms for recognizing the phospho tyro sine and the 
hydrophobic residues amino-terminal to the NPXpY sequence. For example, for She, 
an He residue of a synthetic peptide derived from TRKA receptor 
(HIIENPQpYFSDA, SEQ ID NO:7) binds in a deep hydrophobic pocket located 
10 between p5 and the C-terminal a-helix. The corresponding residue of a peptide 
derived from interleukin-4 receptor (IL-4R) (LVIAGNPApYRS, SEQ ID NO:4), 
binds on the surface of the protein. In addition, the IRS-1 PTB domain recognizes lie 
and Leu of the IL-4R peptide through interactions with a hydrophobic site on the 
surface of a second 

15 P-sheet, whereas the analogous site in She is not available for peptide binding, 

because it is covered by a loop and the N-terminal portion of an a-helix. Notably, in 
contrast to SH2 domains, key arginines that are important for binding to 
phosphotyrosine are located in different regions of different PTB domain sequences. 

20 SNT (sue 1 -associated neurotrophic factor target protein) proteins are two newly 

discovered insulin receptor substrate-like (IRS-like) signaling adaptor molecules that 
are specifically activated by receptors for fibroblast growth factors (FGF), nerve 
growth factors (NGF), and glial-derived neurotrophic factor, but are not activated by 
most other growth factor receptor kinases [Xu et al, J. Biol. Chem. 273:17987-17990 

25 (1998); Kouhara et al Cell 89:693-702 (1997); Meakin et al, J. Biol Chem. 
274:9861-9870 (1999)]. SNT tyrosine phosphorylation promotes activation of 
Ras/MAPK and SHP-2, two biochemical pathways critical for ligand-induced 
biological responses. Current studies suggest that SNT activation enables FGFs and 
NGFs to elicit specific biological responses not achieved by activation of other 

30 receptor tyrosine kinases, which fail to stimulate SNTs. It has further been shown that 
the activation and tyrosine-phosphorylation of SNTs require direct contact between 
receptors and the amino-terminal phosphotyrosine binding domain of each SNT [Xu 
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etal.,J. Biol Chem. 273:17987-17990 (1998); Meakin etal.,J. Biol Chem. 274:9861- 
9870 (1999)]. Whereas SNT PTB domains recognize a canonical NPXpY motif on 
NGF receptors such as TRKA, they unexpectedly bind a tyrosine- and asp aragine- free 
motif in the juxtamembrane segment of the FGF receptor (FGFR). Thus, the SNT 
5 PTB domain represents a very unique protein modular domain, which can specifically 
bind two seemingly unrelated receptor peptide moieties. 

The structural and functional diversity of PTB domains is further demonstrated in the 
SNT PTB domains. Sequence homology alignment and secondary prediction analysis 

10 using an approach of profile-based neural network predictions [PHD method; EMBL- 
Heidelberg et al, Mol Bol 232, 584-599 (1993); Rost and Sander, Proteins 19:55-77 
(1994)] reveals that SNT PTB domains contain a large insert sequence (predicted to 
be an a-helix) located between the corresponding strand [37 and the C-terminal a-helix 
in the She and the IRS-1 PTB domains. It is interesting to note that whereas in all of 

15 the published three-dimensional structures of PTB and PH domains (regardless of 

whether they were determined by NMR or X-ray crystallography) structural variations 
have been found in different loop regions, none of have been observed between the p7 
and the C-terminal a-helix region. 

20 One means of modulating cellular proliferation and/or differentiation is to either 
inhibit or facilitate the interaction of the PTB domain of an SNT and the FGF 
receptor. Therefore, there is a need to identify agonists or antagonists to the 
SNT/FGFR complex. Unfortunately, such identification has heretofore relied on 
serendipity and/or systematic screening of large numbers of natural and synthetic 

25 compounds. A far superior method of drug-screening relies on structure based drug 
design. In this case, the three dimensional structure of SNT/FGFR complex is 
determined and potential agonists and/or potential antagonists are designed with the 
aid of computer modeling [Bugg et al, Scientific American, Dec.:92-98 (1993); West 
et al, TIPS, 16:67-74 (1995); and Dunbrack et al, Folding & Design, 2:27-42 

30 (1997)]. However, heretofore the three-dimensional structure of the SNT/FGFR 
complex has remained unknown. Therefore, there is a need for obtaining a form of 
the SNT/FGFR complex that is amenable for NMR analysis and/or X-ray 
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crystallographic analysis. Furthermore there is a need for the determination of the 
three-dimensional structure of such complexes. Finally, there is a need for procedures 
for related structural based drug design predicated on such structural data. 

5 The citation of any reference herein should not be construed as an admission that such 
reference is available as "Prior Art" to the instant application. 

SUMMARY OF THE INVENTION 

10 The present invention provides the structure of a binding complex between the PTB 
domain of SNT-1 and a peptide derived from the SNT-1 binding site of the fibroblast 
growth factor receptor (FGFR). The structural information provided can be 
employed in methods of identifying drugs that can modulate cellular proliferation 
since the SNT-1/ FGFR interaction plays a major role in the regulation of mitogenic 

1 5 cell signaling. In a particular embodiment, the three-dimensional structural 
information is used in the design of an inhibitor of cell proliferation. Such an 
inhibitor could be used as an anti-tumor agent. 

The present invention also provides an isolated nucleic acid that encodes a 
20 polypeptide that comprises the amino acid residues 1 1- 140 of SEQ ID NO: 1 . In a 
related embodiment the peptide comprises amino acid residues 1 1-140 of SEQ ID 
NO:l with a conservative amino acid substitution. 

The present invention also includes an isolated nucleic acid that encodes a peptide 
25 derived from FGFR1 consisting of 12 to 100 amino acids, preferablyl6 to 50 amino 
acids and more preferably 20 to 30 amino acids which comprises the amino acid 
sequence of SEQ ID NO:5: 

Val Xaa Xaa Leu Xaa Xaa Xaa He Xaa Leu Xaa Arg Xaa Val Xaa Val. 
Preferably this peptide binds to the PTB domain of SNT1. 

30 

In another embodiment, the present invention provides an isolated nucleic acid that 
encodes a peptide derived from FGFR1 consisting of 12 to 100 amino acids, 



preferably 16 to 50 amino acids and more preferably 20 to 30 amino acids which 
comprises the amino acid sequence of SEQ ID NO:3. In a related embodiment the 
nucleic acid encodes a peptide that comprises the amino acid sequence of SEQ ID 
NO: 3 with a conservative amino acid substitution. Again it is preferable that the 
5 peptide binds to the PTB domain of SNT1 . 

All of the nucleic acids of the present invention can be in an isolated form, and/or 
operatively linked to an expression control sequence. In addition, all of the nucleic 
acids of the present invention can further comprise a heterologous nucleotide 

10 sequence. Furthermore, all of the nucleic acids of the present invention operatively 
linked to an expression control sequence can be used to transform or transfect a 
unicellular host. The present invention also provides methods of expressing the 
peptides and proteins fragments of the present invention in the unicellular host. One 
such method comprises culturing the unicellular host in an appropriate cell culture 

1 5 medium under conditions that provide for the expression of the fragment and/or 
peptide by the cell. In addition, the present invention includes methods that further 
comprise the step of purifying the peptides and fragments. The purified forms of the 
fragments and peptides are also included as part of the present invention. 



20 The present invention also provides an isolated polypeptide comprising amino acid 
residues 1 1 -140 of SEQ ID NO: 1 . In a related embodiment the peptide comprises 
amino acid residues 1 1-140 of SEQ ID NO:l with a conservative amino acid 
substitution. 

25 The present invention also includes an isolated peptide derived from FGFR1 
consisting of 12 to 100 amino acids, preferably 16 to 50 amino acids and more 
preferably 20 to 30 amino acids which comprises the amino acid sequence of SEQ ID 
NO:5: 

Val Xaa Xaa Leu Xaa Xaa Xaa He Xaa Leu Xaa Arg Xaa Val Xaa Val. 
30 Preferably this peptide binds to the PTB domain of SNT1 . 
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In another embodiment, the present invention includes an isolated peptide derived 
from FGFR1 consisting of 12 to 100 amino acids, preferablyl6 to 50 amino acids and 
more preferably 20 to 30 amino acids which comprises the amino acid sequence of 
SEQ ID NO:3. In a related embodiment the peptide comprises the amino acid 
5 sequence of SEQ ED NO:3 with a conservative amino acid substitution. Again it is 
preferable that the peptide binds to the PTB domain of SNT1 . In a particular 
embodiment, the peptide has the amino acid sequence of SEQ ID NO:4. 

The present invention also provides fusion proteins/peptides including chimeric 
10 proteins/peptides comprising the peptides and/or fragments of the present invention. 
All of the isolated peptides and/or fragments of the present invention, and all of the 
recombinant peptides and/or fragments of the present invention may be parts of these 
fusion arid chimeric proteins/peptides. In one such embodiment, a fusion protein 
comprises the PTB domain of SNT-1 and the green fluorescent protein. In another 
1 5 such embodiment, the fusion protein comprises a FGFR derived peptide together with 
a FLAG tag. The present invention further provides methods of using the peptides 
and/or fragments of the present invention in a drug screening assay. Any of the 
peptides and/or fragments, and/or fusion proteins/peptides of the present invention 
may be used in such methods. The peptides or protein fragments of the present 
20 invention can also be labeled. In another such embodiment, the peptides and/or 
fragments can be bound to a solid support. 

The present invention further provides assays for testing potential drugs that are 
selected/identified by the three-dimensional structural analysis of SNT/FGFR 
25 complex of the present invention, for the ability of the potential drug to interfere with 
the complex formation, for example. 

The present invention further provides methods of identifying compounds or agents 
that modulate the stability of a SNT/FGFR complex and/or modulate the binding of 
30 SNT with FGFR. In one aspect of the present invention these methods employ the 
three-dimensional structure of the SNT/FGFR complex. 
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One such embodiment comprises selecting a potential compound by performing 
rational drug design with the set of atomic coordinates obtained from Tables 1-5. In 
a related embodiment the selection of the potential compound is performed by rational 
drug design after obtaining a set of atomic coordinates defining the three-dimensional 
5 structure of the SNT/FGFR complex consisting of a fragment of SNT consisting of 
amino acid residues 1 1-140 of SEQ ID NO: 1 and a fragment of FGFR consisting of 
SEQ ID NO:3. Preferably the selecting is performed in conjunction with computer 
modeling. In either case, the potential compound is contacted with a SNT/FGFR 
complex comprising an SNT or an SNT fragment, and FGFR or an FGFR fragment, 

10 and the stability of the SNT/FGFR complex is determined (e.g., measured) in the 
presence of the compound. A potential compound is identified as a compound that 
modulates the stability of the SNT/FGFR complex when there is an change in the 
stability of the SNT/FGFR complex. The compound is identified as a stabilizer of the 
SNT/FGFR complex when the stability of the SNT/FGFR complex increases in its 

1 5 presence, whereas the compound is identified as a destabilizer of the SNT/FGFR 
complex when the stability of the SNT/FGFR complex decreases in its presence. 

In another embodiment, the method identifies a compound or an agent that modulates 
the formation of a SNT/FGFR complex using the three-dimensional structure of the 

20 SNT/FGFR complex. One particular embodiment of this type comprises selecting a 
potential compound that binds to the PTB domain of SNT. In another embodiment, a 
potential compound is selected that binds to the SNT binding region of FGFR. 
Preferably the selection is performed using rational drug design with the set of atomic 
coordinates obtained from Tables 1-5. Alternatively, the selection of the potential 

25 compound is performed by rational drug design after obtaining a set of atomic 
coordinates defining the three-dimensional structure of the SNT/FGFR complex 
consisting of a fragment of SNT consisting of amino acid residues 11-140 of SEQ ID 
NO:l and a fragment of FGFR consisting of SEQ ID NO: 3. Preferably the selection 
is performed in conjunction with computer modeling. The potential compound is 

30 contacted with an SNT or an SNT fragment, and FGFR or an FGFR fragment under 
conditions in which the SNT/FGFR complex can form in the absence of the potential 
compound. The binding affinity of the SNT or the SNT fragment with FGFR or the 
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FGFR fragment is then determined (e.g., measured) in the presence of the compound. 
A potential compound is identified as a compound that modulates the formation of the 
SNT/FGFR complex when there is a change in the binding affinity of the SNT or the 
SNT fragment with FGFR or the FGFR fragment. The compound is identified as an 
5 agonist of the formation of the SNT/FGFR complex when the binding affinity of the 
SNT/FGFR complex increases in its presence, whereas the compound is identified as 
an inhibitor of the formation of the SNT/FGFR complex when the binding affinity of 
the SNT/FGFR complex decreases in its presence. 

10 The present invention further provides a method of selecting a compound or an agent 
that potentially modulates the SNT/FGFR dependent cellular signaling pathway. One 
such embodiment comprises defining the structure of the SNT/FGFR complex by the 
atomic coordinates obtained from Tables 1-5 and selecting a compound which 
potentially modulates the SNT/FGFR dependent cellular signaling pathway with the 

15 aid of the defined structure. In a related embodiment the selection of the potential 
compound is performed by rational drug design after obtaining a set of atomic 
coordinates defining the three-dimensional structure of the SNT/FGFR complex 
consisting of a fragment of SNT consisting of amino acid residues 1 1-140 of SEQ ID 
NO:l and a fragment of FGFR consisting of SEQ ID NO:3. In one such embodiment 

20 the compound inhibits the SNT/FGFR dependent cellular signaling pathway, whereas 
in another embodiment the compound stimulates the SNT/FGFR dependent cellular 
signaling pathway. 

The present invention further provides a method of selecting a compound that 
25 potentially binds to the PTB domain of SNT1 that comprises defining the structure of 
the SNT/FGFR complex by the atomic coordinates obtained from Tables 1-5. With 
the aid of the defined structure a compound is selected which potentially binds the 
PTB domain of SNT1 . In a related embodiment the selection of the potential 
compound is performed by rational drug design after obtaining a set of atomic 
30 coordinates defining the three-dimensional structure of the SNT/FGFR complex 

consisting of a fragment of SNT consisting of amino acid residues 1 1-140 of SEQ ID 
NO:l and a fragment of FGFR consisting of SEQ ID NO:3. In a related embodiment, 
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a compound is selected that binds to the SNT binding region of FGFR. In another 
embodiment, the compound is selected to bind to the SNT/FGFR complex. 

As anyone having skill in the art of drug development would readily understand, the 
5 potential drugs selected by the above methodologies can be refined by re-testing in 
appropriate drug assays, including those disclosed herein. Chemical analogs of such 
potential drugs can be obtained (either through chemical synthesis or drug libraries) 
and be analogously tested. Therefore, methods comprising successive iterations of the 
steps of the individual drug assays, as exemplified herein, using either repetitive or 
10 different binding studies, or transcription activation studies or other such studies are 
envisioned in the present invention. In addition, potential drugs may be identified 
first by rapid throughput drug screening, as described below, prior to performing 
computer modeling on a potential drug using the three-dimensional structure of the 
SNT/FGFR complex. 

15 

The present invention further comprises all of the potential, selected, and putative 
drugs as well as the drugs themselves identified by methods of the present invention. 

Accordingly, it is a principal object of the present invention to provide the three- 
20 dimensional coordinates of the SNT/FGFR complex. 

It is a further object of the present invention to provide soluble fragments of SNT that 
comprise the PTB domain, and that can bind to FGFR. 

25 It is a further object of the present invention to provide a soluble fragment of FGFR 
that comprises the binding site for the PTB of SNT. 

It is a further object of the present invention to provide methods of identifying drugs 
that can modulate the SNT/FGFR interaction. 



30 
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It is a further object of the present invention to provide methods of identifying drugs 
that can modulate cellular proliferation through modulating the SNT/FGFR 
interaction. 

5 It is a further object of the present invention to provide methods that incorporate the 
use of rational design for identifying drugs that stabilize the SNT/FGFR complex. 

It is a further object of the present invention to provide methods that incorporate the 
use of rational design for identifying drugs that inhibit the SNT/FGFR complex. 

10 

It is a further object of the present invention to provide detailed structural information 
regarding the binding of SNT and FGFR electronically, magnetically, or 
electromagnetically. 

15 It is a further object of the present invention to provide a computer that comprises a 
representation of the three-dimensional structure of the SNT/FGFR binding complex 
and/or relevant portions thereof. 

It is a further object of the present invention to provide methodologies for exploiting 
20 such structural information in order to develop potential anti-tumor drugs with the use 
of structure based rational drug design. 

These and other aspects of the present invention will be better appreciated by 
reference to the following drawings and Detailed Description. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A-1B depict an alignment of protein sequence homology. Figure 1 A depicts 
the Sequence alignment of PTB domains of the SNT and IRS proteins. Amino acid 
30 sequence and accession numbers of the proteins are indicated along the protein 

sequences. Protein sequences of FRS2a and FRS2(i have been reported [Ong et ai, 
Mol.Cell.BioL 20:979-989 (2000)]. The experimentally determined secondary 
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structure elements are displayed above or below the sequences of the PTB domains of 
SNTs or IRSs [Zhou et al. 9 Nat. Struc, Biol 3:388-393 (1996)], respectively. 
Asterisks highlight residues in the SNT-1 PTB domain that show intermolecular 
NOEs to the hFGFRl peptide. Absolutely or highly conserved residues among the 
5 SNT and IRS PTB domains are shown in red and blue, respectively. Two underlined 
Arg residues of SNT-1 were both changed by site-directed mutagenesis to Gin. 
Arrows indicate constructs used in truncation analysis of SNT- 1 PTB domain binding 
to hFGFRl or TRK. Pro residues located C-terminal to the SNT-1 PTB domain are 
shown in bold. Figure IB depicts the Sequence alignment of the juxtamembrane 

10 region of the FGFR family. For each FGFR group (FGFR1-4), protein sequences 

from three representative species, i.e., human, mouse, and xenopus, are selected. The 
number of observed intermolecular NOEs identified for a particular amino acid 
residue of the hFGFRl peptide is shown in red above the sequence. Absolutely or 
highly conserved residues are highlighted in yellow and blue background, 

1 5 respectively. 

Figures 2A-2D depict the structure of the SNT-1 PTB domain/hPGFRl complex- 
Figure 2 A shows the stereo view of the backbone atom superposition of the final 20 
NMR-derived structures of the complex, containing the SNT-1 PTB domain residues 

20 18-116 and the hFGFRl peptide residues 41 1-430. The terminal residues, which are 
structurally disordered, are omitted for clarity. For the final 20 structures, the 
root-mean-square deviations (RMSDs) of the backbone and all heavy atoms for 
protein residues 18-116 are 0.74 ± 0.16A and 1.46 ± 0.16A, respectively. The 
corresponding RMSDs for the protein secondary structure regions (protein residues 

25 19-24, 35-40, 45-49, 52-57, 63-68, 71-76, 85-90, 94-107 and 111-115) are 0.40 ± 
0.05 A and 0.88 ± 0.05 A, respectively. The RMSDs of the backbone and all heavy 
atoms for the hFGFRl peptide (residues 412-430) are 0.56 ± O.lOA and 1.25 ± 0.15A, 
respectively. Figure 2B is a ribbon depiction of the averaged minimized NMR 
structure of the SNT-1 PTB domain/hFGFRl complex. The orientation of Figure 2B 

30 is as shown in Figure 2A. Figure 2C is a ribbon diagram of the SNT-1 PTB domain 
structure from the top of the protein, which is rotated ~90 ° from the orientation in 
Figure 2B. Figure 2D is a molecular surface representation of the SNT-1 PTB 



13 

domain structure calculated in GRASP [Nicholls et ai, Biophys. J. 64:166-170 
(1993)]. The protein is color-coded by surface curvature, and the color gradient from 
green to dark gray reflects decreasing solvent exposure. The hFGFRl peptide 
molecule is shown as a ball and stick representation color-coded by atom-type. 

5 

Figure 3A depicts the 2D 'H/^N-HSQC spectrum of the SNT-1 PTB domain (0.25 
mM) complexed to a FGFR peptide (1:1). The spectrum was recorded on a Bruker 
DRX 500 MHz NMR spectrometer for the protein in 50 mM phosphate buffer of pH 
6.5 containing 5 mM DTT and 0.5 mM EDTA at 25°C. Figure 3B depicts the effects 
10 of a FGFR peptide or a TRKA phosphopeptide binding on the backbone amide 
signals in the HSQC spectrum of the SNT-1 PTB domain. The amide peaks of the 
protein in the free, the FGFR peptide-, or the TRKA phosphopeptide-complexed 
forms are color-coded in black, red, and blue, respectively. Green arrows indicate the 
shifts of the amide signals upon binding to the corresponding peptides. 

15 

Figures 4A-4E depict the intermolecular interactions in the SNT-1 PTB 
domain/hFGFRl complex. Figure 4 depicts the secondary structure of the 
intermolecular antiparallel P-sheet of the complex. The number of intermolecular 
NOEs observed in I3 C- or i5 N-edited (F y ), i3 C/ I5 N-filtered (F 5 ) 3D NOESY spectra is 

20 summarized for individual amino acid residue. NOEs that define the structure of the 
p-sheet are indicated by arrows. Arrows for intramolecular, 13 C-based and 15 N-based 
intermolecular NOEs, are color-coded in black, green and red, respectively. Broken 
lines (blue) highlight two intermolecular hydrogen bonds that are supported by amide 
exchange data. The intermolecular interactions are depicted for three-regions of the 

25 hFGFRl peptide (green). Figure 4B depicts the C-terminal (residues 424-430) region. 
The side-chains of the protein and the peptide residues are displayed in orange and 
blue, respectively. Figure 4C shows the middle (residues 417-423) region. The three 
loops connecting pi/p2, p3/p4 and p6/p7 that form a hydrophobic binding pocket for 
binding to the peptide residue Val-414 are colored in pink. Figure 4D shows the 

30 N-terminal (residues 409-416) region. Figure 4E depicts the complementary 
electrostatic interactions between the SNT-1 PTB domain and hFGFRl peptide 
residues. 
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Figure 5 depicts a schematic of a computer comprising a central processing unit 
("CPU"), a working memory, a mass storage memory, a display terminal, and a 
keyboard that are interconnected by a conventional bidirectional system bus. The 
computer can be used to display and manipulate the structural data of the present 
5 invention. 

Figures 6A-6D show the mutational analysis of the SNT-1 PTB domain interactions 
with FGFRs or TRKs. Figure 6 A shows the effect of hFGFRl point mutations on 
interactions with the SNT-1 PTB domain, determined by yeast two-hybrid binding 

10 assays. Data for peptide mutants are calculated from an average of 5 independent 
experiments. Western blot showing BD fusion protein expression of wild type and 
mutant hFGFRl in the yeast cells. Figure 6B shows the structure of the SNT-1 PTB 
domain/hFGFRl complex showing location of Arg-63 and Arg-78 (blue) that are 
essential for binding to the phosphotyrosine in the NPXpY motif. The backbone of 

15 the hFGFRl peptide is shown in green. The distinct p8 strand of the SNT-1 PTB 

domain is displayed in red. Figure 6C depicts the structure of the IRS-1 PTB domain 
in complex with a tyrosine-phosphorylated peptide derived from interleukin-4 
receptor (LVIAGNPApYRS, residues 489-499) determined by NMR [Zhou et al 9 
Nat. Struc, Biol. 3:388-393 (1996)]. The peptide residues are shown in green, and the 

20 two key Arg residues (Arg-212 and Arg-227) of the PTB domain that are essential for 
phosphotyrosine binding are displayed in blue. Figure 6D shows the results of a yeast 
two-hybrid binding studies of SNT-1 p8 truncation effect on its interactions with 
hFGFRl and tyrosine-phosphorylated TRKB. The panel framed in red shows the loss 
of interaction between hFGFRl and the SNT-1 PTB domain protein lacking the p8 

25 strand. Colony formation on the synthetic complete medium lacking Leu and Trp 
(Leu\ Trp") illustrates the efficiency of co-transformation with the two plasmids, 
while growth on the corresponding medium lacking His, Leu and Trp (His", Leu", 
Trp") shows level of protein-protein interaction. 



30 
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DETAILED DESCRIPTION OF THE INVENTION 



The present invention provides the first detailed structural information regarding the 
complex of the PTB domain of SNT-1 with a peptide derived from the known SNT-1 
5 binding site of the fibroblast growth factor receptor (FGFR). The present invention 
therefore provides the three-dimensional structure of the SNT-1 PTB domain/FGFR 
complex. Since the SNT-1 /FGFR interaction plays a significant role in the regulation 
of mitogenic cell signaling, the structural information provided herein can be 
employed in methods of identifying drugs that can modulate the proliferation of cells. 
10 In a particular embodiment, the three-dimensional structural information is used in the 
design of an inhibitor of cell proliferation for the treatment of cancer. 

The three-dimensional structure of the SNT-1 PTB domain/FGFR complex disclosed 
herein demonstrates that the interaction between the PTB domain of SNT-1 and 

15 FGFR differs significantly from the manner that PTB domains have been reported to 
interact with the canonical NPXpY motif (where X is any amino acid and pY is a 
phosphorylated tyrosine). Furthermore, the structural information disclosed herein 
provides the basis for mutational analysis of the SNT-1 PTB/FGFR complex. Key 
amino acid residues from both SNT-1 and FGFR that are essential for the protein- 

20 protein recognition are also identified. 

SNT-1 also plays a role in regulating nerve growth factor (NGF) receptor signaling 
via its binding through its PTB domain with the activated/tyrosine phosphorylated 
NGF receptor. As stated above this interaction differs significantly with that 

25 disclosed herein for the SNT/ FGFR complex. Based on the new structural 

information, the key amino acid residues for the binding of the SNT 1 -PTB domain 
with the NGF TRKA receptor can be identified and further elucidated using basic 
mutagenesis and standard isothermal titration calorimetry. The results obtained from 
the structural and functional studies disclosed herein indicate that there is a novel 

30 mechanism through which SNT proteins regulate neuronal cell growth and 
differentiation. Such detailed structural and mechanistic findings provide the 
foundation for structure-based rational drug design. The agents identified by this 
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procedure will be useful for ameliorating conditions involving the dysfunction of cell 
signaling pathways involving the NGF receptor and the FGF receptor. More 
particularly, FGFRs have been implicated to play a causal role in the development of 
the following human ailments such as in tumorigenesis including various forms of 
5 human cancer, and in skeletal disorders, such as Achondroplasia, hypochondroplasia, 
Crouzon syndrome, Apert syndrome, and Pfeiffer syndrome. 

Structure based rational drug design is the most efficient method of drug 
development. However, heretofore, little information has been disclosed regarding 

10 the structure of the SNT/FGFR interaction. Obtaining detailed structural information 
requires an extensive NMR or X-ray crystallographic analysis. In the former case, the 
entire SNT-1 /FGFR complex has a molecular weight which is beyond the present 
capabilities of NMR analysis. In the latter case, crystallography of proteins remains a 
black art. The present invention overcomes the difficulties described above, by 

15 providing fragments of the SNT-1/FGFR complex (identified below) that retain the 
necessary structural elements for binding. Such SNT-1/FGFR complexes are 
amenable to NMR structural analysis. By determining and then exploiting the 
detailed structural information of this complex (exemplified by NMR analysis below) 
the present invention provides novel methods for developing new anti-tumor drugs 

20 through structure based rational drug design. 

Thus the present invention provides a representative set of structural coordinates for 
the SNT-1 PTB domain/FGFR peptide complex (Table 1) which were obtained by 
NMR analysis. A Ribbon diagrams of the three-dimensional structure of the SNT-1 

25 PTB domain in complex with the FGFR peptide is shown in Figures 2 and 4. The 
present invention also provides the NOE-derived distance restraints, and NMR 
chemical shift assignments of the SNT-1 PTB domain/FGFR peptide. The NMR 
chemical shift assignments of the SNT-1 PTB domain/FGFR peptide complex are 
included in the chemical shift table (Table 2) for the l H- 15 N HSQC spectrum of SNT- 

30 1 PTB domain/FGFR peptide complex. Tables 3-5 contain the NMR experimentally 
determined distance restraints that are used in the structural calculations, including the 
hydrogen bond distance restraints (Table 3), the unambiguous distance restraints 
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(Table 4) and the ambiguous distance restraints (Table 5). The sample coordinates 
data set of Table 1, the chemical shifts of Table 2, along with the information 
contained in Tables 3-5 are sufficient to enable the skilled artisan to practice the 
invention. In addition, Tables 1-5 are also capable of being placed into a computer 
5 readable form which is also part of the present invention. Furthermore, methods of 
using these coordinates and chemical shifts and related information (including in 
computer readable forms) in drug assays are disclosed. More particularly, such 
coordinates can be used to identify potential ligands or drugs which will modulate the 
binding of the SNT with FGFR. Since the SNT/FGFR interaction is naturally 
10 stimulated by FGF and leads to the activation of cell proliferation agents such as Ras, 
compounds that inhibit the SNT FGFR binding will act to inhibit cellular proliferation 
at a comparatively early stage of the signal transduction pathway. Similarly, 
compounds that act as agonists for the SNT FGFR interaction will act as agonists for 
cellular proliferation. 

15 

In addition, the present invention provides a computer that comprises a representation 
of the SNT/FGFR binding complex in computer memory that can be used to screen 
for compounds that will enhance or alternatively inhibit the binding of SNT to FGFR. 
In a related embodiment, the computer can be used in the design of altered SNT 

20 and/or FGFR proteins that have either enhanced, or alternatively diminished binding 
affinity for each other. Preferably, the computer comprises portions or all of the 
information contained in Tables 1-5. In a particular embodiment, the computer 
comprises: (i) a machine-readable data storage material encoded with machine- 
readable data, (ii) a working memory for storing instructions for processing the 

25 machine readable data, (iii) a central processing unit coupled to the working memory 
and the machine-readable data storage material for processing the machine-readable 
data into a three-dimensional representation, and (iv) a display coupled to the central 
processing unit for displaying the three-dimensional representation. 

30 Thus the machine-readable data storage medium comprises a data storage material 
encoded with machine readable data which can comprise portions or all of the 
structural information contained in Tables 1-5. One embodiment for manipulating 
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and displaying the structural data provided by the present invention is schematically 
depicted in Figure 5. As depicted, the System 1, includes a computer 2 comprising a 
central processing unit ("CPU") 3, a working memory 4 which may be random-access 
memory or "core' 1 memory, mass storage memory 5 (e.g., one or more disk or CD- 
5 ROM drives), a display terminal 6 (e.g., a cathode-ray tube), one or more keyboards 
7, one or more input lines 10, and one or more output lines 20, all of which are 
interconnected by a conventional bidirectional system bus 30. 

Input hardware 12, coupled to the computer 2 by input lines 10, may be implemented 
10 in a variety of ways. Machine-readable data may be inputted via the use of one or 
more modems 14 connected by a telephone line or dedicated data line 16. 
Alternatively or additionally, the input hardware 12 may comprise CD-ROM or disk 
drives 5. In conjunction with the display terminal 6, the keyboard 7 may also be used 
as an input device. Output hardware 22, coupled to computer 2 by output lines 20, 
1 5 may similarly be implemented by conventional devices. Output hardware 22 may 
include a display terminal 6 for displaying the three dimensional data. Output 
hardware might also include a printer 24, so that a hard copy output may be produced, 
or a disk drive 5, to store system output for later use, see also U.S. Patent No: 
5,978,740, Issued November 2, 1999, the contents of which are hereby incorporated 
20 by reference in their entireties. 

In operation, the CPU 3 (i) coordinates the use of the various input and output devices 
12 and 22; (ii) coordinates data, accesses from mass storage 5 and accesses to and 
from working memory 4; and (iii) determines the sequence of data processing steps. 
25 Any of a number of programs may be used to process the machine-readable data of 
this invention. 

Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 

30 



As used herein the term "SNT-1 PTB domain/FGFR peptide complex" is used 
interchangeably with the term "SNT/FGFR complex" and the term "SNT-FGFR 
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complex" are all meant to denote a binding complex between the PTB domain of SNT 
or portion thereof that binds to the FGF receptor, and a fragment of the FGF receptor 
that binds to the PTB domain of SNT. One such a complex is identified in the 
Example below. 

5 

As used herein the term "SNT/FGFR dependent cellular signaling pathway" is a 
cellular signaling pathway in which the direct interaction between SNT and the FGF 
receptor {i.e., binding) is involved in transmitting the signal from an extracellular 
ligand for the receptor to the nucleus of the cell. 

10 

A "polypeptide" comprising a fragment of FGFR or SNT (or more particularly the 
SNT PTB domain) as used herein can be the "fragment" alone, or a larger chimeric or 
fusion peptide/protein which contains the "fragment". 

15 As used herein a polypeptide or peptide "consisting essentially of or that "consists 
essentially of a specified amino acid sequence is a polypeptide or peptide that retains 
the general characteristics, e.g., activity of the polypeptide or peptide having the 
specified amino acid sequence and is otherwise identical to that protein in amino acid 
sequence except it consists of plus or minus 10% or fewer, preferably plus or minus 

20 5% or fewer, and more preferably plus or minus 2.5% or fewer amino acid residues. 

As used herein the terms "fusion protein" and "fusion peptide" are used 
interchangeably and encompass "chimeric proteins and/or chimeric peptides" and 
fusion "intein proteins/peptides". A fusion protein of the present invention can 
25 comprises a portion of an SNT or FGFR of the present invention joined via a peptide 
bond to at least a portion of another protein or peptide including a second SNT or 
FGFR protein in a chimeric fusion protein. For example fusion proteins can comprise 
a marker protein or peptide, or a protein or peptide that aids in the identification 
and/or monitoring of an SNT or FGFR peptide/protein of the present invention. 

30 

As used herein the terms "approximately" and "about" are used to signify that a value 
is within ten percent of the indicated value i.e., a protein fragment containing 
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"approximately" 140 amino acid residues can contain between 126 and 154 amino 
acid residues. 

As used herein the term "binds to" is meant to include all such specific interactions 
5 that result in two or more molecules showing a preference for one another relative to 
some third molecule. This includes processes such as covalent, ionic, hydrophobic 
and hydrogen bonding but does not include non-specific associations such solvent 
preferences. 

10 As used herein, and unless otherwise specified, the terms "agent", "potential drug", 
"test compound" or "potential compound" are used interchangeably, and refer to 
chemicals which potentially have a use as an inhibitor or activator/stabilizer of 
SNT/FGFR binding, and preferably include drugs for the treatment or prevention of a 
disease and/or condition involving the FGF receptor. Therefore, such "agents", 

15 "potential drugs", and "potential compounds" may be used, as described herein, in 
drug assays and drug screens and the like. 

As used herein a "small organic molecule" is an organic compound [or organic 
compound complexed with an inorganic compound (e.g., metal)] that has a molecular 
20 weight of less than 3 Kilodaltons. 

As used herein, the term "gene" refers to an assembly of nucleotides that encode a 
polypeptide, and includes cDNA and genomic DNA nucleic acids. 

25 A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA 
segment may be attached so as to bring about the replication of the attached segment. 
A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that functions 
as an autonomous unit of DNA replication in vivo, i.e., capable of replication under its 
own control. 

30 

A "cassette" refers to a segment of DNA that can be inserted into a vector at specific 
restriction sites. The segment of DNA encodes a polypeptide of interest, and the 



21 

cassette and restriction sites are designed to ensure insertion of the cassette in the 
proper reading frame for transcription and translation. 

A cell has been "transfected" by exogenous or heterologous DNA when such DNA 
5 has been introduced inside the cell. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxy adenosine, deoxyguanosine, deoxythymidine, or 

10 deoxycytidine; "DNA molecules"), or any phosphoester analogues thereof, such as 
phosphorothioates and thioesters, in either single stranded form, or a double-stranded 
helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. 
The term nucleic acid molecule, and in particular DNA or RNA molecule, refers only 
to the primary and secondary structure of the molecule, and does not limit it to any 

15 particular tertiary forms. Thus, this term includes double-stranded DNA found, inter 
alia, in linear or circular DNA molecules {e.g., restriction fragments), plasmids, and 
chromosomes. In discussing the structure of particular double-stranded DNA 
molecules, sequences may be described herein according to the normal convention of 
giving only the sequence in the 5 ' to 3 ' direction along the nontranscribed strand of 

20 DNA (i.e., the strand having a sequence homologous to the mRNA). A "recombinant 
DNA molecule" is a DNA molecule that has undergone a molecular biological 
manipulation. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such as a 
25 cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid 
molecule can anneal to the other nucleic acid molecule under the appropriate 
conditions of temperature and solution ionic strength (see Sambrook et aL, supra). 
The conditions of temperature and ionic strength determine the "stringency" of the 
hybridization. For preliminary screening for homologous nucleic acids, low 
30 stringency hybridization conditions, corresponding to a T m of 55°, can be used, e.g., 
5x SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% formamide, 5x SSC, 
0.5% SDS). Moderate stringency hybridization conditions correspond to a higher T m , 



22 

e.g., 40% formamide, with 5x or 6x SCC. High stringency hybridization conditions 
correspond to the highest T m , e.g., 50% formamide, 5x or 6x SCC. Hybridization 
requires that the two nucleic acids contain complementary sequences, although 
depending on the stringency of the hybridization, mismatches between bases are 
5 possible. The appropriate stringency for hybridizing nucleic acids depends on the 
length of the nucleic acids and the degree of complementation, variables well known 
in the art. The greater the degree of similarity or homology between two nucleotide 
sequences, the greater the value of T m for hybrids of nucleic acids having those 
sequences. The relative stability (corresponding to higher T m ) of nucleic acid 

10 hybridizations decreases in the following order: RNA:RNA, DNA:RNA, DNArDNA. 
For hybrids of greater than 100 nucleotides in length, equations for calculating T m 
have been derived (see Sambrook et al, supra, 9.50-10.51). For hybridization with 
shorter nucleic acids, i.e., oligonucleotides, the position of mismatches becomes more 
important, and the length of the oligonucleotide determines its specificity (see 

15 Sambrook et al, supra, 11.7-11.8). Preferably a minimum length for a hybridizable 
nucleic acid is at least about 12 nucleotides; preferably at least about 18 nucleotides; 
and more preferably the length is at least about 27 nucleotides; and most preferably 36 
nucleotides. 

20 In a specific embodiment, the term "standard hybridization conditions" refers to a T m 
of 55 °C, and utilizes conditions as set forth above. In a preferred embodiment, the T m 
is 60 °C; in a more preferred embodiment, the T m is 65 °C. Preferably the wash and 
hybridization (binding) steps are identical. 

25 A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed 
and translated into a polypeptide in a cell in vitro or in vivo when placed under the 
control of appropriate regulatory sequences. The boundaries of the coding sequence 
are determined by a start codon at the 5 ' (amino) terminus and a translation stop 
codon at the 3' (carboxyl) terminus. A coding sequence can include, but is not 

30 limited to, prokaryotic sequences and synthetic DNA sequences. If the coding 

sequence is intended for expression in a eukaryotic cell, a polyadenylation signal and 
transcription termination sequence will usually be located 3' to the coding sequence. 
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Transcriptional and translational control sequences are DNA regulatory sequences, 
such as promoters, enhancers, terminators, and the like, that provide for the 
expression of a coding sequence in a host cell. In eukaryotic cells, polyadenylation 
signals are control sequences. 

5 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3 ' direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
10 direction) to include the minimum number of bases or elements necessary to initiate 
transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined for example, by 
mapping with nuclease SI), as well as protein binding domains (consensus sequences) 
responsible for the binding of RNA polymerase. 

15 

A coding sequence is "under the control" of transcriptional and translational control 
sequences in a cell when RNA polymerase transcribes the coding sequence into 
mRNA, which is then trans-RNA spliced and translated into the protein encoded by 
the coding sequence. 

20 

A DNA sequence is "operatively linked" to an expression control sequence when the 
expression control sequence controls and regulates the transcription and translation of 
that DNA sequence. The term "operatively linked" includes having an appropriate 
start signal (e.g., ATG) in front of the DNA sequence to be expressed and maintaining 
25 the correct reading frame to permit expression of the DNA sequence under the control 
of the expression control sequence and production of the desired product encoded by 
the DNA sequence. If a gene that one desires to insert into a recombinant DNA 
molecule does not contain an appropriate start signal, such a start signal can be 
inserted in front of the gene. 

30 

As used herein, the term "homologous" in all its grammatical forms refers to the 
relationship between proteins that possess a "common evolutionary origin," including 



24 

proteins from superfamilies (e.g., the immunoglobulin superfamily) and homologous 
proteins from different species (e.g., myosin light chain, etc.) [Reeck et al, Cell, 
50:667 (1987)]. Such proteins have sequence homology as reflected by their high 
degree of sequence similarity. 

5 

In a specific embodiment, two DNA sequences are "substantially homologous" or 
"substantially similar" when at least about 50% (preferably at least about 75%, and 
most preferably at least about 90 or 95%) of the nucleotides match over the defined 
length of the DNA sequences. Sequences that are substantially homologous can be 
10 identified by comparing the sequences using standard software available in sequence 
data banks, or in a Southern hybridization experiment under, for example, stringent 
conditions as defined for that particular system. Defining appropriate hybridization 
conditions is within the skill of the art. See, e.g., Maniatis et al, supra; DNA 
Cloning, Vols. I & II, supra; Nucleic Acid Hybridization, supra, 

15 

Similarly, in a particular embodiment, two amino acid sequences are "substantially 
homologous" or "substantially similar" when greater than 25% of the amino acids are 
identical (preferably at least about 50%, more preferably at least about 75%, and most 
preferably at least about 90 or 95% identical), or greater than about 60% (preferably at 

20 least about 75%, more preferably at least about 90%, and most preferably at least 

about 95 or 100%) are functionally identical. The sequence comparison is performed 
over a contiguous block of amino acid residues comprised by SNT, for example. In a 
preferred embodiment selected deletions or insertions that could otherwise alter the 
correspondence between the two amino acid sequences are taken into account. 

25 Preferably standard computer analysis is employed for the determination that is 
comparable, (or identical) to that determined with an Advanced Blast search at 
www.ncbi.nlm.nih.gov under the default filter conditions [e.g., using the GCG 
(Genetics Computer Group, Program Manual for the GCG Package, Version 7, 
Madison, Wisconsin) pileup program using the default parameters]. 

30 

As used herein an amino acid sequence is 100% "homologous" to a second amino 
acid sequence if the two amino acid sequences are identical, and/or differ only by 
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neutral or conservative substitutions as defined below. Accordingly, an amino acid 
sequence is 50% "homologous" to a second amino acid sequence if 50% of the two 
amino acid sequences are identical, and/or differ only by neutral or conservative 
substitutions. 

5 

The term "corresponding to" is used herein to refer similar or homologous sequences, 
whether the exact position is identical or different from the molecule to which the 
similarity or homology is measured. Thus, the term "corresponding to" refers to the 
sequence similarity, and not the numbering of the amino acid residues or nucleotide 
10 bases. 

As used herein a "heterologous nucleotide sequence" is a nucleotide sequence that is 
added to a nucleotide sequence of the present invention by recombinant methods to 
form a nucleic acid which is not naturally formed in nature. Such nucleic acids can 

15 encode fusion proteins or peptides, including chimeric proteins and peptides. Thus 
the heterologous nucleotide sequence can encode peptides and/or proteins which 
contain regulatory and/or structural properties. In another such embodiment the 
heterologous nucleotide can encode a protein or peptide that functions as a means of 
detecting the protein or peptide encoded by the nucleotide sequence of the present 

20 invention after the recombinant nucleic acid is expressed. In still another such 
embodiment the heterologous nucleotide can function as a means of detecting a 
nucleotide sequence of the present invention. A heterologous nucleotide sequence 
can comprise non-coding sequences including restriction sites, regulatory sites, 
promoters and the like. 

25 

General Techniques for Constructing Nucleic Acids That Express the SNT or FGFR 
Fragments of the Present Invention. 

In accordance with the present invention there may be employed conventional 
30 molecular biology, microbiology, and recombinant DNA techniques within the skill 
of the art. Such techniques are explained fully in the literature. See, e.g., Sambrook, 
Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition 
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(1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (herein 
"Sambrook et aL, 1989"); DNA Cloning: A Practical Approach, Volumes I and II 
(Glover ed. 1985); Oligonucleotide Synthesis (MJ. Gait ed. 1984); Nucleic Acid 
Hybridization [B.D. Hames & S.J. Higgins eds. (1985)]; Transcription And 
5 Translation [B.D. Hames & S J. Higgins, eds. (1984)]; Animal Cell Culture [R.I. 

Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, (1986)]; B. Perbal, 
A Practical Guide To Molecular Cloning (1984); F.M. Ausubel et al. (eds.), Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 

10 The present invention also relates to cloning vectors containing nucleic acids 

encoding analogs and derivatives of the SNT and FGFR fragments of the present 
invention, including modified fragments, that have the same or homologous 
functional activity as the individual fragments, and homologs thereof. The production 
and use of derivatives and analogs related to the fragments are within the scope of the 

1 5 present invention. 

Due to the degeneracy of nucleotide coding sequences, other DNA sequences which 
encode substantially the same amino acid sequence as a nucleic acid encoding an 
FGFR fragment or peptide, or SNT PTB domain of the present invention may be used 

20 in the practice of the present invention. These include but are not limited to allelic 
genes, homologous genes from other species, which are altered by the substitution of 
different codons that encode the same amino acid residue within the sequence, thus 
producing a silent change. Likewise, the FGFR fragments or peptides, or SNT PTB 
domains of the invention include, but are not limited to, those containing, as a primary 

25 amino acid sequence, analogous portions of their respective amino acid sequences 
including altered sequences in which functionally equivalent amino acid residues are 
substituted for residues within the sequence resulting in a conservative amino acid 
substitution. For example, one or more amino acid residues within the sequence can 
be substituted by another amino acid of a similar polarity, which acts as a functional 

30 equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid 
belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, 



leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. 
Amino acids containing aromatic ring structures are phenylalanine, tryptophan, and 
tyrosine. The polar neutral amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine. The positively charged (basic) amino acids 
5 include arginine, and lysine. The negatively charged (acidic) amino acids include 
aspartic acid and glutamic acid. 

Particularly preferred conserved amino acid exchanges are: 
(a) Lys for Arg or vice versa such that a positive charge may be maintained; 
10 (b) GluforAsp or vice versa such that a negative charge may be maintained; 

(c) Ser for Thr or vice versa such that a free -OH can be maintained; 

(d) Gin for Asn or vice versa such that a free NH 2 can be maintained; 

(e) He for Leu or for Val or vice versa as roughly equivalent hydrophobic amino 
acids; and 

15 (f) Phe for Tyr or vice versa as roughly equivalent aromatic amino acids. 

A conservative change generally leads to less change in the structure and function of 
the resulting protein. A non-conservative change is more likely to alter the structure, 
activity or function of the resulting protein. The present invention should be 
20 considered to include sequences containing conservative changes which do not 
significantly alter the activity or binding characteristics of the resulting protein. 
Specific amino acid residues for both the SNT and the FGFR fragments have been 
identified that are important for binding, indicating a lower stringency for the 
substitution of the remaining amino acids residues. 

25 

All of the FGFR and SNT PTB domain peptides/fragments of the present invention 
can be modified by being placed in a fusion or chimeric peptide or protein, or labeled 
e.g., to have an N-terminal FLAG-tag, or His6 tag. In a particular embodiment the 
SNT PTB domain fragment can be modified to contain a marker protein such as green 
30 fluorescent protein as described in U.S. Patent No. 5,625,048 filed April 29, 1997 and 
WO 97/26333, published July 24, 1997 each of which are hereby incorporated by 
reference herein in their entireties. 
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The nucleic acids encoding peptides and protein fragments of the present invention 
and analogs thereof can be produced by various methods known in the art. The 
manipulations which result in their production can occur at the gene or protein level 
[Sambrook et al 9 1989, supra]. The nucleotide sequence can be cleaved at 
5 appropriate sites with restriction endonuclease(s), followed by further enzymatic 
modification if desired, isolated, and ligated in vitro. In addition a nucleic acid 
sequence can be mutated in vitro or in vivo, to create and/or destroy translation, 
initiation, and/or termination sequences, or to create variations in coding regions 
and/or form new restriction endonuclease sites or destroy preexisting ones, to . 

10 facilitate further in vitro modification. Any technique for mutagenesis known in the 
art can be used, including but not limited to, in vitro site-directed mutagenesis 
[Hutchinson et al. 9 J. Biol. Chem., 253:6551 (1978); Zoller and Smith, DNA, 3:479- 
488 (1984); Oliphant et ai, Gene, 44:177 (1986); Hutchinson et al.,Proc. Natl Acad. 
ScL U.S.A., 83:710 (1986)], use of TAB® linkers (Pharmacia), etc. PCR techniques 

15 are preferred for site directed mutagenesis [see Higuchi, 1989, "Using PCR to 
Engineer DNA", in PCR Technology: Principles and Applications for DNA 
Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61-70]. 

The identified and isolated nucleic acids can then be inserted into an appropriate 
20 cloning vector. A large number of vector-host systems known in the art may be used. 

Protein expression and purification 
A bacterial protein expression system can be used to make various stable isotopically 
labeled ( 13 C, 15 N, and 2 H) protein samples that are useful for a three-dimensional 
25 NMR structural determination of a protein complex. For example a pETl 5b 
(Novagen) bacterial expression vector can be constructed which expresses the 
recombinant SNT-1 PTB domain as an amino-terminal His-tagged fusion protein. 

Protein expression and purification can be conducted using standard procedures for 
30 His-tagged proteins [Zhou et ai, J. Biol. Chem. 270:311 19-31123 (1995)]. To 
optimize the level of protein expression, various bacterial growth and expression 
conditions can be screened, which include different E. Coli cell lines, and growth and 
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protein induction temperatures. Generally, it is preferred to obtain the maximum 
amount of soluble protein while still inducing protein expression with a relatively low 
IPTG concentration e.g., ~0.2 mM (final concentration) at 16 °C. Under these 
conditions, reasonable quantities of protein were obtained (5-10 mg/ml of soluble 
5 protein) using the His-tagged plasmid transformed in E. Coli BL21(DE3) cells grown 
in M9 minimal medium. The N-terminal His-tag can be readily cleaved with the 
treatment of protease thrombin in a 20 mM Tris buffer of pH 8.0 containing 200 mM 
NaCl and 5 mM (3-Mercapto-ethanoL 

10 One major advantage of using the heteronuclear multidimensional approach, as 

exemplified herein, is that the NMR resonance assignments of a protein are obtained 
in a sequence-specific manner which assures accuracy and greatly facilitates data 
analysis and structure determination [Clore, and Gronenborn, Meth. Enzymol. 
239:249-363 (1994)]. In addition, the signal overlapping problems in the protein 

1 5 spectra are minimized by the use of multidimensional NMR spectra, which separates 
the proton signals according to the chemical shifts of their attached hetero-nuclei 
(such as 15 N and 13 C). This NMR approach has been proven very powerful for 
structural analysis of large proteins [Clore and Gronenborn, Meth. Enzymol. 239:249- 
363 (1994)]. To facilitate sequence-specific resonance assignments for the structural 

20 study, a uniformly 13 C, 15 N-labeled and fractionally (75%) deuterated protein sample 
of the SNT-1 PTB domain can be prepared by growing bacterial cells in 75% 2 H 2 0. 
Such protein samples can be used for triple-resonance NMR experiments. A triple- 
labeled protein sample is useful for high-resolution NMR structural studies. Because 
of the favorable *H, 13 C, and 15 N relaxation rates caused by the partial deuteration of 

25 the protein, constant-time triple-resonance NMR spectra can be acquired with higher 
digital resolution and sensitivity [Sattler and Fesik, Structure 4:1245-1249 (1996)]. 
In addition, various stable-isotopically labeled ( 15 N and 13 C / 15 N) proteins can also be 
prepared using this procedure. 
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Synthetic Polypeptides 
The term "polypeptide" is used in its broadest sense to refer to a compound of two or 
more subunit amino acids, amino acid analogs, or peptidomimetics. The subunits are 
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linked by peptide bonds. The FGFR peptides of the present invention may be 
chemically synthesized. 

In addition, potential drugs or agents that may be tested in the drug screening assays 
5 of the present invention may also be chemically synthesized. Synthetic polypeptides, 
prepared using the well known techniques of solid phase, liquid phase, or peptide 
condensation techniques, or any combination thereof, can include natural and 
unnatural amino acids. Amino acids used for peptide synthesis may be standard Boc 
(N a -amino protected N a -t-butyloxycarbonyl) amino acid resin with the standard 

10 deprotecting, neutralization, coupling and wash protocols of the original solid phase 
procedure of Merrifield [J. Am. Chem. Soc, 85:2149-2154 (1963)], or the base-labile 
N a -amino protected 9-fluorenylmethoxycarbonyl (Fmoc) amino acids first described 
by Carpino and Han [J. Org. Chem., 37:3403-3409 (1972)]. Both Fmoc and Boc N a - 
amino protected amino acids can be obtained from Fluka, Bachem, Advanced 

15 Chemtech, Sigma, Cambridge Research Biochemical, Bachem, or Peninsula Labs or 
other chemical companies familiar to those who practice this art. In addition, the 
method of the invention can be used with other N a -protecting groups that are familiar 
to those skilled in this art. Solid phase peptide synthesis may be accomplished by 
techniques familiar to those in the art and provided, for example, in Stewart and 

20 Young [Solid Phase Synthesis, Second Edition, Pierce Chemical Co., Rockford, IL 

(1984)] and Fields and Noble [Int. J. Pept. Protein Res., 35:161-214 (1990)], or using 
automated synthesizers, such as sold by ABS. Thus, polypeptides of the invention 
may comprise D-amino acids, a combination of D- and L-amino acids, and various 
"designer" amino acids (e.g., (3-methyl amino acids, Ca-methyl amino acids, and Na- 

25 methyl amino acids, etc.) to convey special properties. Synthetic amino acids include 
ornithine for lysine, fluorophenylalanine for phenylalanine, and norleucine for leucine 
or isoleucine. Additionally, by assigning specific amino acids at specific coupling 
steps, a-helices, (3 turns, p sheets, y-turns, and cyclic peptides can be generated. 

30 In a further embodiment, subunits of peptides that confer useful chemical and 

structural properties will be chosen. For example, peptides comprising D-amino acids 
will be resistant to L-amino acid-specific proteases in vivo. In addition, the present 
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invention envisions preparing peptides that have more well defined structural 
properties, and the use of peptidomimetics, and peptidomimetic bonds, such as ester 
bonds, to prepare peptides with novel properties. In another embodiment, a peptide 
may be generated that incorporates a reduced peptide bond, i.e., R r CH 2 -NH-R 2 , 
5 where R t and R 2 are amino acid residues or sequences. A reduced peptide bond may 
be introduced as a dipeptide subunit. Such a molecule would be resistant to peptide 
bond hydrolysis, e.g. , protease activity. Such peptides would provide ligands with 
unique function and activity, such as extended half-lives in vivo due to resistance to 
metabolic breakdown, or protease activity. Furthermore, it is well known that in 
10 certain systems constrained peptides show enhanced functional activity [Hruby, Life 
Sciences, 31:189-199 (1982); Hruby et al 9 Biochem J., 268:249-262 (1990)]; the 
present invention provides a method to produce a constrained peptide that 
incorporates random sequences at all other positions. 

15 Constrained and cyclic peptides. A constrained, cyclic or rigidized peptide may be 
prepared synthetically, provided that in at least two positions in the sequence of the 
peptide an amino acid or amino acid analog is inserted that provides a chemical 
functional group capable of crosslinking to constrain, cyclise or rigidize the peptide 
after treatment to form the crosslink. Cyclization will be favored when a turn- 

20 inducing amino acid is incorporated. Examples of amino acids capable of 

crosslinking a peptide are cysteine to form disulfides, aspartic acid to form a lactone 
or a lactam, and a chelator such as y-carboxyl-glutamic acid (Gla) (Bachem) to chelate 
a transition metal and form a cross-link. Protected y-carboxyl glutamic acid may be 
prepared by modifying the synthesis described by Zee-Cheng and Olson [Biophys. 

25 Biochem. Res. Commun., 94:1 128-1 132 (1980)]. A peptide in which the peptide 

sequence comprises at least two amino acids capable of crosslinking may be treated, 
e.g., by oxidation of cysteine residues to form a disulfide or addition of a metal ion to 
form a chelate, so as to crosslink the peptide and form a constrained, cyclic or 
rigidized peptide. 

30 

The present invention provides strategies to systematically prepare cross-links. For 
example, if four cysteine residues are incorporated in the peptide sequence, different 
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protecting groups may be used (Hiskey, in The Peptides: Analysis, Synthesis, 
Biology, Vol. 3, Gross and Meienhofer, eds., Academic Press: New York, pp. 137- 
167 (1981); Ponsanti et al., Tetrahedron, 46:8255-8266 (1990)]. The first pair of 
cysteines may be deprotected and oxidized, then the second set may be deprotected 
5 and oxidized. In this way a defined set of disulfide cross-links may be formed. 
Alternatively, a pair of cysteines and a pair of chelating amino acid analogs may be 
incorporated so that the cross-links are of a different chemical nature. 

Non-classical amino acids that induce conformational constraints. The following 
1 0 non-classical amino acids may be incorporated in the peptide in order to introduce 
particular conformational motifs: l,2,3,4-tetrahydroisoquinoline-3-carboxylate 
[Kazmierski etal.,J. Am. Chem. Soc, 113:2275-2283 (1991)]; (2S,3S)-methyl- 
phenylalanine, (2S,3R)-methyl-phenylalanine, (2R,3S)-methyl-phenylalanine and 
(2R,3R)-methyl-phenylalanine (Kazmierski and Hruby, Tetrahedron Lett. (1991)]; 2- 
15 aminotetrahydronaphthalene-2-carboxylic acid [Landis, Ph.D. Thesis, University of 
Arizona (1989)]; hydroxy- 1, 2, 3,4-tetrahydroisoquinoline-3-carboxylate [Miyake et 
al, J. Takeda Res. Labs., 43:53-76 (1989)]; (3-carboline (D and L) [Kazmierski, Ph.D. 
Thesis, University of Arizona (1988)]; HIC (histidine isoquinoline carboxylic acid) 
[Zechel et al, Int. J. Pep. Protein Res., 43 (1991)]; and HIC (histidine cyclic urea) 
20 (Dharanipragada). 

The following amino acid analogs and peptidomimetics may be incorporated into a 
peptide to induce or favor specific secondary structures: LL-Acp (LL-3-amino- 
2-propenidone-6-carboxylic acid), a (3-turn inducing dipeptide analog [Kemp et ah, J. 

25 Org. Chem., 50:5834-5838 (1985)]; p-sheet inducing analogs [Kemp et al, 

Tetrahedron Lett, 29:5081-5082 (1988); (3-turn inducing analogs [Kemp etaL, 
Tetrahedron Lett, 29:5057-5060 (1988)]; <*-helix inducing analogs (Kemp et al. 9 
Tetrahedron Lett., 29:4935-4938 (1988)]; y-turn inducing analogs [Kemp et ai, J. 
Org. Chem., 54:109:115 (1989)]; and analogs provided by the following references: 

30 Nagai and Sato, Tetrahedron Lett., 26:647-650 (1985); DiMaio et al 9 J. Chem. Soc. 
Per/cin Trans., p. 1687 (1989); also a Gly-Ala turn analog [Kahn et al. 9 Tetrahedron 
Lett., 30:2317 (1989)]; amide bond isostere [Jones et al. 9 Tetrahedron Lett., 29:3853- 
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3856 (1988)]; tretrazol [Zabrocki et al.,J. Am. Chem. Soc, 110:5875-5880 (1988)]; 
DTC [Samanen et al., Int. J. Protein Pep. Res., 35:501:509 (1990)]; and analogs 
taught in Olson et al, J. Am. Chem. Set, 112:323-333 (1990) and Garvey et al.,J. 
Org. Chem., 56:436 (1990). Conformational^/ restricted mimetics of beta turns and 
5 beta bulges, and peptides containing them, are described in U.S. Patent No. 5,440,013, 
issued August 8, 1995 to Kahn. 

Structure-based Mutation Analysis 

10 Protein structural analysis using NMR spectroscopy has several unique advantages. 
In addition to high-resolution three-dimensional structural information, the chemical 
shift assignments for the protein obtained in the structural study further provides a 
map of the entire protein at the atomic level, which can be used for structure-based 
biochemical analysis of protein-protein interactions. For example, the information 

15 generated from the NMR structural analysis can also serve to identify specific amino 
acid residues in the peptide-binding site for complementary mutagenesis studies. 
Specific focus can be placed on those residues that display long-range NOEs 
(particularly the side-chain NOEs in the 13 C-NOESY data) between the PTB domain 
and the FGFR peptides, for example. 

20 

To ensure mutant proteins are valid for functional analysis, it can be determined as to 
whether a mutation results in any significant perturbation of the overall conformation 
of the PTB domain, particularly the effects of mutation on the peptide binding sites. 
NMR spectroscopy is a powerful method for examining the effects of such a mutation 

25 on the conformation of the protein. One can readily obtain information about the 
global conformation of a mutant protein from the proton ( ] H) ID spectrum, by 
examining the chemical shift dispersion and peak line-width of NMR signals of 
amide, aromatic and aliphatic protons. Moreover, 2D ] H- 15 N HSQC spectra reveal 
details of the effects of a mutation on both local and global conformation of the 

30 protein, since every single 'H/^N signal (both the chemical shift and line-shape) in the 
NMR spectrum is a "reporter" for a particular amino acid residue. Thus, to assess 
how mutations effect protein stability and the overall protein conformation, the 15 N 
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HSQC spectra of mutated proteins can be compared to that of the wild-type protein. 

Chemical-shift perturbations due to ligand binding have proven to be a reliable and 
sensitive probe for the ligand binding site of the protein. This is because the 
5 chemical-shift changes of the backbone amide groups are likely to reflect any changes 
in protein conformation and/or hydrogen bonding due to the peptide/ligand binding. 
To examine the effects of a mutation on the ligand binding (in this case the ligand is a 
peptide), peptide titration experiments can be conducted by following the changes of 
'H/^N signals of the mutant proteins as a function of the peptide concentration. These 

10 experiments indicate whether the peptide-binding site remains the same or changes in 
the mutants relative to the wild type protein. The effects of the mutation on the 
peptide binding affinity can also be examined by NMR spectroscopy. Due to the high 
binding affinity of the peptides for the wild type SNT-1 PTB domain, signals of the 
free and peptide-bound forms exhibit a slow exchange on the NMR time scale during 

1 5 the peptide titration. If the mutated proteins result in the reduction of the binding 

affinity, a change of the exchange phenomenon between the free and the ligand-bound 
signals should be observed in NMR spectrum. If the reduction in binding affinity 
causes the peptide binding to change from a slow exchange rate to a fast exchange 
rate, on the NMR time scale, then the peptide binding affinity can be determined from 

20 the NMR titration experiment. From these mutation analyses key amino acid residues 
that are important for binding either the tyrosine-phosphorylated TRKA receptor 
peptide or the non-phosphorylated FGF receptor peptide can be identified. 

Protein Structure Determination by NMR Spectroscopy 
25 The NMR results from the present invention are summarized in Table 1, which 

includes exemplary structural coordinates, Table 2, which contains the chemical shift 
data, Table 3, which contains the Hbond results, Table 4, which contains the 
unambiguous determinations, and Table 5, which contains the ambiguous 
determinations. 

30 

Backbone and Side-chain Assignments: Sequence-specific backbone assignment is 
achieved by using a suite of deuterium-decoupled triple-resonance 3D NMR 
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experiments which include HNCA, HN(CO)CA, HN(CA)CB, HN(COCA)CB, 
HNCO, and HN(CA)CO experiments [Yamazaki, et aL, J. Am. Chem. Soc. 
116:1 1655-11666 (1994)]. The water flip-back scheme is used in these NMR pulse 
programs to minimize amide signal attenuation from water exchange. Sequential 
5 side-chain assignments are typically accomplished from a series of 3D NMR 
experiments with alternative approaches to confirm the assignments. These 
experiments include 3D ,5 N TOCSY-HSQC, HCCH-TOCSY, (H)C(CO)NH- 
TOCSY, and H(C)(CO)NH-TOCSY [see Clore and Gronenborn, Meth. EnzymoL 
239:249-363 (1994); Sattler et aL, Prog, in Nuclear Magnetic Resonance Spec. 4:93- 
10 158 (1999)]. 

Stereospecific Methyl Groups: Stereospecific assignments of methyl groups of Valine 
and Leucine residues are obtained from an analysis of carbon signal multiplet splitting 
using a fractionally 13 C-labeled protein sample, which can be readily prepared using 
15 M9 minimal medium containing 10% I3 C-/90% 12 C-glucose mixture [see Neri, et aL, 
Biochemistry 28:7510-7516 (1989)]. 

Dihedral Angle Restraints: Backbone dihedral angle (<£>) constraints are generated 
from the V HNHa coupling constants measured in a HNHA- J experiment [see Vuister, 
20 G. & Bax, A. J. Am. Chem. Soc. 115:7772-7777 (1993)]. Side-chain dihedral angles 
(xl) can be obtained from short mixing time 15 N-edited 3D TOCSY-HSQC [see Clore, 
et aL, J f Biomol NMR 1:13-22 (1991)] and 3D HNHB experiments [see Matson et aL, 
J. Biomol. NMR 3:239-244 (1993)], which can also provide stereospecific 
assignments of P methylene protons. 

25 

Hydrogen Bonds Restraints: Amide protons that are involved in hydrogen bonds can 
be identified from an analysis of amide exchange rates measured from a series of 2D 
'H/^N HSQC spectra recorded after adding 2 H 2 0 to the protein sample. 

30 NOE Distance Restraints: Distance restraints are obtained from analysis of 15 N 3 and 
I3 C-edited 3D NOESY data, which can be collected with different mixing times to 
minimize spin diffusion problems. The nuclear Overhauser effect (NOE)-derived 
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restraints are categorized as strong (1.8-3 A), medium (1.8-4 A) or weak (L8-5 A) 
based on the observed NOE intensities. A recently developed procedure for the 
iterative automated NOE analysis by using ARIA [see Nilges et al. , Prog. NMR 
Spectroscopy 32:107-139 (1998)] can be employed which integrates with X-PLOR for 
5 structural calculations. To ensure the success of ARIA/X-PLOR- assisted NOE 

analysis and structure calculations, the ARIA assigned NOE peaks can be manually 
confirmed. 

Intermolecular NOE Distance Restrains: For the structural determination of a 
1 0 protein/peptide complex, intermolecular NOE distance restraints can be obtained from 
a 13 C-edited (F,) and 15 N, and 13 C-filtered (F 3 ) 3D NOESY data set collected for a 
sample containing isotope-labeled protein and non-labeled peptide. 

Structure Calculations and Refinements'. Structures of the protein can be generated 
1 5 using a distance geometry/simulated annealing protocol with the X-PLOR program 
[see Nilges et al., FEES Lett 229:317-324 (1988); Kuszewski, et al, J. Biolmol NMR 
2:33-56 (1992); Briinger, X-PLOR Version 3 J: A system for X-Ray crystallography 
and NMR (Yale University Press, New Haven, CT, 1993)]. The structure calculations 
can employ inter-proton distance restraints obtained from 15 N- and 13 C-resolved 
20 NOESY spectra. The initial low-resolution structures can be used to facilitate NOE 
assignments, and help identify hydrogen bonding partners for slowly exchanging 
amide protons. The experimental restraints of dihedral angles and hydrogen bonds can 
be included in the distance restraints for structure refinements. 

25 Protein-Structure Based Design of Agonists and 

Antagonists of The SNT/FGFR Complex 

Once the three-dimensional structure of the SNT/FGFR complex is determined, a 
potential drug or agent (antagonist or agonist) can be examined through the use of 
30 computer modeling using a docking program such as GRAM, DOCK, or 

AUTODOCK [Dunbrack et al, 1997, supra]. This procedure can include computer 
fitting of potential agents to the PTB domain of SNT-1, for example, to ascertain how 
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well the shape and the chemical structure of the potential ligand will complement or 
interfere with the interaction between SNT-1 and FGFR [Bugg et al, Scientific 
American, Dec.:92-98 (1993); West et al, TIPS, 16:67-74 (1995)]. Computer 
programs can also be employed to estimate the attraction, repulsion, and steric 
5 hindrance of the agent to the dimer-dimer binding site, for example. Generally the 
tighter the fit (e.g., the lower the steric hindrance, and/or the greater the attractive 
force) the more potent the potential drug will be since these properties are consistent 
with a tighter binding constant. Furthermore, the more specificity in the design of a 
potential drug the more likely that the drug will not interfere with related proteins. 
10 This will minimize potential side-effects due to unwanted interactions with other 
proteins. 

Initially a potential drug could be obtained by screening a random peptide library 
produced by recombinant bacteriophage for example, [Scott and Smith, Science, 

15 249:386-390 (1990); Cwirla et al, Proc. Natl Acad. ScL, 87:6378-6382 (1990); 

Devlin et al, Science, 249:404-406 (1990)] or a chemical library. An agent selected 
in this manner could be then be systematically modified by computer modeling 
programs until one or more promising potential drugs are identified. Such analysis 
has been shown to be effective in the development of HIV protease inhibitors [Lam et 

20 al, Science 263:380-384 (1994); Wlodawer et al, Ann. Rev. Biochem. 62:543-585 

(1993); Appelt, Perspectives in Drug Discovery and Design 1:23-48 (1993); Erickson, 
Perspectives in Drug Discovery and Design 1:109-128 (1993)]. 

Such computer modeling allows the selection of a finite number of rational chemical 
25 modifications, as opposed to the countless number of essentially random chemical 
modifications that could be made, any of which any one might lead to a useful drug. 
Each chemical modification requires additional chemical steps, which while being 
reasonable for the synthesis of a finite number of compounds, quickly becomes 
overwhelming if all possible modifications needed to be synthesized. Thus through 
30 the use of the three-dimensional structural analysis disclosed herein and computer 
modeling,' a large number of these compounds can be rapidly screened on the 
computer monitor screen, and a few likely candidates can be determined without the 
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laborious synthesis of untold numbers of compounds. 

Once a potential drug (agonist or antagonist) is identified it can be either selected 
from a library of chemicals as are commercially available from most large chemical 
5 companies including Merck, Glaxo Welcome, Bristol Meyers Squib, Monsanto/Searle, 
Eli Lilly, Novartis and Pharmacia UpJohn, or alternatively the potential drug may be 
synthesized de novo. As mentioned above, the de novo synthesis of one or even a 
relatively small group of specific compounds is reasonable in the art of drug design. 

10 The potential drug can then be tested in any standard binding assay (including in high 
throughput binding assays) for its ability to bind to SNT or fragment thereof 
comprising the PTB domain. Alternatively the potential drug can be tested for its 
ability to modulate (either inhibit or stimulate) a cellular signal that is dependent on 
the interaction of SNT with FGFR. When a suitable potential drug is identified, a 

1 5 second NMR structural analysis can optionally be performed on the binding complex 
formed between the SNT/FGFR complex and the potential drug. Computer programs 
that can be used to aid in solving such three-dimensional structures include 
QUANTA, CHARMM, INSIGHT, SYBYL, MACROMODE, and ICM, MOLMOL, 
RASMOL, AND GRASP [Kraulis, J. Appl Crystaltogr. 24:946-950 (1991)]. Most if 

20 not all of these programs and others as well can be also obtained from the 
Worldwide Web through the internet. 

Using the approach described herein and equipped with the structural analysis 
disclosed herein, the three-dimensional structures of other SNT/receptor complexes 
25 can more readily be obtained and analyzed. Such analysis will, in turn, allow 
corresponding drug screening methodology to be performed using the three- 
dimensional structures of such related complexes. 

For all of the drug screening assays described herein further refinements to the 
30 structure of the drug will generally be necessary and can be made by the successive 
iterations of any and/or all of the steps provided by the particular drug screening 
assay, including further structural analysis by NMR, for example. 
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Phage libraries for Drug Screening. 

Phage libraries have been constructed which when infected into host E. coli produce 
random peptide sequences of approximately 10 to 15 amino acids [Parmley and 
Smith, Gene 73:305-318 (1988), Scott and Smith, Science 249:386-249 (1990)]. 
5 Specifically, the phage library can be mixed in low dilutions with permissive E. coli 
in low melting point LB agar which is then poured on top of LB agar plates. After 
incubating the plates at 37°C for a period of time, small clear plaques in a lawn of E. 
coli will form which represents active phage growth and lysis of the E. coli. A 
representative of these phages can be absorbed to nylon filters by placing dry filters 

1 0 onto the agar plates. The filters can be marked for orientation, removed, and placed in 
washing solutions to block any remaining absorbent sites. The filters can then be 
placed in a solution containing, for example, a radioactive PTB domain (e.g., the 
peptide comprising amino acid residues 1 1-140 of SEQ ID NO:l). After a specified 
incubation period, the filters can be thoroughly washed and developed for 

1 5 autoradiography. Plaques containing the phage that bind to the radioactive PTB 

domain can then be identified. These phages can be further cloned and then retested 
for their ability to bind to the PTB domain as before. Once the phage has been 
purified, the binding sequence contained within the phage can be determined by 
standard DNA sequencing techniques. Once the DNA sequence is known, synthetic 

20 peptides can be generated which are encoded by these sequences. 

These peptides can be tested, for example, for their ability to e.g., interfere with the 
binding of SNT with FGFR. 

25 The effective peptide(s) can be synthesized in large quantities for use in in vivo 
models and eventually in humans to treat certain tumors. It should be emphasized 
that synthetic peptide production is relatively non-labor intensive, easily 
manufactured, quality controlled and thus, large quantities of the desired product can 
be produced quite cheaply. Similar combinations of mass produced synthetic peptides 

30 have recently been used with great success [Patarroyo, Vaccine, 10:175-178 (1990)]. 
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Drug Screening Assays 
The drug screening assays of the present invention may use any of a number of means 
for determining the interaction between an agent or drug and SNT and/or FGFR. 
In one such assay, a drug can be specifically designed to bind to the PTB domain of 
5 SNT-1 through NMR based methodology. [Shuker et al 9 Science 274:1531-1534 
(1996) hereby incorporated by reference in its entirety.] In a particular embodiment, 
analogs of the FGFR derived peptide having the amino acid sequence of SEQ ID 
NO:3 are used. In a particular embodiment of this type, the peptide has the amino 
acid sequence of SEQ ID NO:4. In another such embodiment of this type, the peptide 
1 0 has the amino acid sequence of SEQ ID NO:5. Alternatively, a library of low 

molecular weight compounds can be screened to identify a binding partner for the 
PTB domain. Any such chemical library can be used including those discussed 
above. 

15 The assay begins with contacting a compound with a 15 N-labeled SNT PTB domain. 
Binding of the compound with the SNT PTB domain can be determined by 
monitoring the 15 N- or ^-amide chemical shift changes in two dimensional 15 N- 
heteronuclear single-quantum correlation ( 15 N-HSQC) spectra upon the addition of the 
compound to the 15 N-labeled SNT PTB domain. Since these spectra can be rapidly 

20 obtained, it is feasible to screen a large number of compounds [Shuker et aL, Science 
274:1531-1534 (1996)]. A compound is identified as a potential ligand if it binds to 
the SNT PTB domain. In a further embodiment, the potential ligand can then be used 
as a model structure, and analogs to the compound can be obtained (e.g., from the vast 
chemical libraries commercially available, or alternatively through de novo synthesis). 

25 The analogs are then screened for their ability to bind the SNT PTB domain to obtain 
a ligand. An analog of the potential ligand is chosen as a ligand when it binds to the 
SNT PTB domain with a higher binding affinity than the potential ligand. In a 
preferred embodiment of this type the analogs are screened by monitoring the l5 N- or 
^-amide chemical shift changes in two dimensional l5 N-heteronuclear single- 

30 quantum correlation ( 15 N-HSQC) spectra upon the addition of the analog to the 15 N- 
labeled SNT domain as described above. 
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In another further embodiment, compounds are screened for binding to two nearby 
sites on an SNT PTB domain. In this case, a compound that binds a first site of the 
SNT PTB domain does not bind a second nearby site. Binding to the second site can 
be determined by monitoring changes in a different set of amide chemical shifts in 
5 either the original screen or a second screen conducted in the presence of a ligand (or 
potential ligand) for the first site. From an analysis of the chemical shift changes the 
approximate location of a potential ligand for the second site is identified. 
Optimization of the second ligand for binding to the site is then carried out by 
screening structurally related compounds (e.g., analogs as described above). When 

10 ligands for the first site and the second site are identified, their location and 

orientation in the ternary complex can be determined experimentally either by NMR 
spectroscopy or X-ray crystallography. On the basis of this structural information, a 
linked compound is synthesized in which the ligand for the first site and the ligand for 
the second site are linked. In a preferred embodiment of this type the two ligands are 

15 covalently linked. This linked compound is tested to determine if it has a higher 

binding affinity for the SNT PTB domain than either of the two indiviual ligands. A 
linked compound is selected as a ligand when it has a higher binding affinity for the 
SNT PTB domain than either of the two ligands. In a preferred embodiment the 
affinity of the linked compound with the SNT PTB domain is determined monitoring 

20 the 15 N- or *H- amide chemical shift changes in two dimensional 15 N-heteronuclear 
single-quantum correlation ( 15 N-HSQC) spectra upon the addition of the linked 
compound to the 15 N-labeled SNT PTB domain as described above. 

A larger linked compound can be constructed in an analogous manner, e.g., linking 
25 three ligands which bind to three nearby sites on the SNT PTB domain to form a 

multilinked compound that has an even higher affinity for the SNT PTB domain than 
linked compound. 

In another assay, a SNT PTB domain is placed on or coated onto a solid support. 
30 Methods for placing the peptides or proteins on the solid support are well known in 
the art and include such things as linking biotin to the protein and linking avidin to the 
solid support. An agent is allowed to equilibrate with the SNT PTB domain to test for 
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binding. Generally, the solid support is washed and agents that are retained are 
selected as potential drugs. In a particular embodiment of this type, the SNT PTB 
domain comprises amino acid residues 1 1-140 of SEQ ID NO:l . 

5 The agent may be labeled. For example, in one embodiment radiolabeled agents are 
used to measure the binding of the agent. In another embodiment the agents have 
fluorescent markers. In yet another embodiment, a Biocore chip (Pharmacia) coated 
with the SNT PTB domain is used and the change in surface conductivity can be 
measured. 

10 

In yet another embodiment, the affect of a prospective drug (a test compound or 
agent) on a SNT PTB domain is assayed in a living cell that contains or can be 
induced to contain SNT-1 and FGFR. The cell can also contain or can be constructed 
to contain one or more reporter genes, such as a heterologous gene comprising a 

1 5 nucleic acid encoding luciferase, green fluorescent protein, chloramphenicol acetyl 
transferase, and/or |3-galactosidase etc. Such reporter genes can be operably linked to 
a promoter comprising a binding site for a transcription factor under the control of the 
SNT/FGFR dependent cellular signaling pathway. Cells that naturally encode a 
FGFR and SNT-1 may be used, or alternatively a cell that is transfected with plasmids 

20 encoding the reporter proteins can be used, though care must be taken to ensure that 
the requisite participants in the SNT/FGFR dependent cellular signaling pathway are 
also present. 

Assays for detecting the reporter genes products are readily available in the literature. 

25 For example, luciferase assays can be performed according to the manufacturer's 
protocol (Promega), and P-galactosidase assays can be performed as described by 
Ausubel et al. [in Current Protocols in Molecular Biology, J. Wiley & Sons, Inc. 
(1994)]. The preparation of such plasmid containing reporter genes is now routine in 
the art, and many appropriate plasmids are now commercially available which can be 

30 readily modified for such assays. 

The prospective drug is generally tested under conditions in which the FGFR has been 
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activated. The FGF receptor can be activated by an antibody or preferably FGF. 
Alternatively, a permantively activated FGF can be constitutively expressed. In one 
embodiment, the expression of SNT-1 is constitutive. The amount (and/or activity) of 
reporter protein produced in the absence and presence of prospective drug can then be 
5 determined and compared. Prospective drugs which reduce the amount (and/or 

activity) of the reporter protein produced are candidate antagonists of the SNT/FGFR 
complex, whereas prospective drugs which increase the amount (and/or activity) of 
reporter protein produced are candidate agonists. 

10 

Labels : 

Suitable labels include enzymes, fluorophores (e.g., fluorescein isothiocyanate 
(FITC), phycoerythrin (PE), Texas red (TR), rhodamine, free or chelated lanthanide 
series salts, especially Eu 3+ , to name a few fluorophores), chromophores, 
15 radioisotopes, chelating agents, dyes, colloidal gold, latex particles, ligands (e.g., 

biotin), and chemiluminescent agents. When a control marker is employed, the same 
or different labels may be used for the test and control marker gene. 

In the instance where a radioactive label, such as the isotopes 3 H, 14 C, 32 P, 35 S, 36 C1, 
20 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 I, 131 I, and 186 Re are used, known currently available 
counting procedures may be utilized. In the instance where the label is an enzyme, 
detection may be accomplished by any of the presently utilized colorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric or gasometric 
techniques known in the art. 

25 

Direct labels are one example of labels which can be used according to the present 
invention. A direct label has been defined as an entity, which in its natural state, is 
readily visible, either to the naked eye, or with the aid of an optical filter and/or 
applied stimulation, e.g, ultraviolet light to promote fluorescence. Among examples 
30 of colored labels, which can be used according to the present invention, include 
metallic sol particles, for example, gold sol particles such as those described by 
Leuvering (U.S. Patent 4,313,734); dye sole particles such as described by Gribnau et 
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al (U.S. Patent 4,373,932 and May et al (WO 88/08534); dyed latex such as 
described by May, supra, Snyder (EP-A 0 280 559 and 0 281 327); or dyes 
encapsulated in liposomes as described by Campbell et al. (U.S. Patent 4,703,017). 
Other direct labels include a radionucleotide, a fluorescent moiety or a luminescent 
5 moiety. In addition to these direct labeling devices, indirect labels comprising 
enzymes can also be used according to the present invention. Various types of 
enzyme linked immunoassays are well known in the art, for example, alkaline 
phosphatase and horseradish peroxidase, lysozyme, glucose-6-phosphate 
dehydrogenase, lactate dehydrogenase, urease, these and others have been discussed 
10 in detail by Eva Engvall in Enzyme Immunoassay ELISA and EMIT in Methods in 
Enzymology, 70:419-439 (1980) and in U.S. Patent 4,857,453. 

Suitable enzymes include, but are not limited to, alkaline phosphatase, p- 
galactosidase, green fluorescent protein and its derivatives, luciferase, and horseradish 
15 peroxidase. 

Other labels for use in the invention include magnetic beads or magnetic resonance 
imaging labels. 

Antibodies to the SNT and FGFR fragments 
According to the present invention, the SNT and FGFR peptides and fragments as 
produced by a recombinant source, or through chemical synthesis, or through the 
modification of these peptides and fragments; and derivatives or analogs thereof, 
including fusion proteins, may be used as an immunogen to generate antibodies that 
specifically interfere with the formation of the SNT/FGFR complex. Such antibodies 
include but are not limited to polyclonal, monoclonal, chimeric, single chain, Fab 
fragments, and a Fab expression library. 

Various procedures known in the art may be used for the production of the polyclonal 
30 antibodies. For the production of antibody, various host animals can be immunized 
by injection with the peptide having the amino acid sequence of SEQ ID NO: 3 for 
example, or a derivative (e.g., a fusion protein) thereof, including but not limited to 
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rabbits, mice, rats, sheep, goats, etc. In one embodiment, the peptide can be 
conjugated to an immunogenic carrier, e.g., bovine serum albumin (BSA) or keyhole 
limpet hemocyanin (KLH). Various adjuvants may be used to increase the 
immunological response, depending on the host species, including but not limited to 
5 Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, 

surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, 
oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG {bacille Calmette-Guerin) and Corynebacterium 
parvum. 

10 

q For preparation of monoclonal antibodies directed toward the peptides or protein 

™: fragments of the present invention, or analog, or derivative thereof, any technique that 

Ul provides for the production of antibody molecules by continuous cell lines in culture 

may be used. These include but are not limited to the hybridoma technique originally 
Tl 15 developed by Kohler and Milstein [Nature, 256:495-497 (1975)], as well as the trioma 

5 technique, the human B-cell hybridoma technique [Kozbor et al, Immunology Today, 

f7 4:72 (1983); Cote et al, Proc. Natl. Acad. Set U.S.A., 80:2026-2030 (1983)], and the 

y EBV-hybridoma technique to produce human monoclonal antibodies [Cole et al, in 

p Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)]. In 

r ~ 20 an additional embodiment of the invention, monoclonal antibodies can be produced in 

germ- free animals utilizing recent technology [PCT/US90/02545]. In fact, according 
to the invention, techniques developed for the production of "chimeric antibodies" 
[Morrison et al., J. Bacteriol, 159:870 (1984); Neuberger et al, Nature, 312:604-608 
(1984); Takeda et al, Nature, 314:452-454 (1985)] by splicing the genes from a 
25 mouse antibody molecule specific for the peptide having the amino acid sequence of 
SEQ ID NO:3 together with genes from a human antibody molecule of appropriate 
biological activity can be used; such antibodies are within the scope of this invention. 
Such human or humanized chimeric antibodies are preferred for use in therapy of 
human diseases or disorders (described infra), since the human or humanized 
30 antibodies are much less likely than xeno genie antibodies to induce an immune 
response, in particular an allergic response, themselves. 
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According to the invention, techniques described for the production of single chain 
antibodies [U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. Patent 
4,946,778] can be adapted to produce specific single chain antibodies. An additional 
embodiment of the invention utilizes the techniques described for the construction of 
5 Fab expression libraries [Huse et al, Science, 246:1275-1281 (1989)] to allow rapid 
and easy identification of monoclonal Fab fragments with the desired specificity. 

Antibody fragments which contain the idiotype of the antibody molecule can be 
generated by known techniques. For example, such fragments include but are not 
10 limited to: the F(ab') 2 fragment which can be produced by pepsin digestion of the 
antibody molecule; the Fab' fragments which can be generated by reducing the 
disulfide bridges of the F(ab')2 fragment, and the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent. 

15 In the production of antibodies, screening for the desired antibody can be 

accomplished by techniques known in the art, e.g., radioimmunoassay, ELISA 
(enzyme-linked immunosorbant assay), "sandwich" immunoassays, 
immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion 
assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, for 

20 example), western blots, precipitation reactions, agglutination assays {e.g., gel 
agglutination assays, hemagglutination assays), complement fixation assays, 
immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc. 
In one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting 

25 binding of a secondary antibody or reagent to the primary antibody. In a further 

embodiment, the secondary antibody is labeled. Many means are known in the art for 
detecting binding in an immunoassay and are within the scope of the present 
invention. For example, to select antibodies which recognize a specific epitope of an 
FGFR, for example, one may assay generated hybridomas for a product which binds 

30 to an FGFR fragment containing such epitope and choose those which do not cross- 
react with FGFR fragments that do not include that epitope. 
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In a specific embodiment, antibodies that interfere with the formation of the 
SNT/FGFR complex can be generated. Such antibodies can be tested using the assays 
described could potentially be used in anti-tumor therapies. 



10 



The present invention may be better understood by reference to the following non- 
limiting Example, which is provided as exemplary of the invention. The following 
example is presented in order to more fully illustrate the preferred embodiments of the 
invention. It should in no way be construed, however, as limiting the broad scope of 
the invention. 

EXAMPLE 

THE THREE-DIMENSIONAL STRUCTURE OF 
THE SNT-1 PTB DOMAIN WITH THE FGF RECEPTOR 



15 Introduction 

Phosphotyrosine binding (PTB) domains represent a group of structurally related but 
functionally divergent protein modules that play an important role in regulating 
protein-phosphotyrosine, protein-phospholipids, or protein-protein interactions 
[Pawson, and Scott, Nature 278: 2075-2080 (1997); Zhou and Fesik, Prog. Biophys. 

20 Molec. Biol 64:221-235 (1995)]. The amino-terminal PTB domains of the newly 
discovered lipid- anchored docking proteins SNT-1 and SNT-2 have been shown to 
possess a unique ability to recognize both a canonical NPXp Y motif on the activated 
nerve growth factor (NGF) receptor, and a tyrosine and asparagine-free motif on the 
fibroblast growth factor (FGF) receptor [Xu etal, J. Biol Chem. 273:17987-17990 

25 (1998); Kouhara et al Cell 89:693-702 (1997); Meakin et al. 9 J. Biol Chem. 
274:9861-9870(1999)]. 

Materials and Methods 
Sample Preparation: A cDNA fragment encoding the SNT-1 PTB domain was 
cloned into a modified pET28b vector (Novagen) that produced the recombinant 
30 protein with a cleavable hexa-histidine (His 6 ) tag at the C-terminus. Uniformly l5 N- 
and 15 N/ 13 C-labeled proteins were prepared by growing Escherichia coli BL21(DE3) 
cells in a minimal medium containing I5 NH 4 C1 with or without I3 C 6 -glucose. A 
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uniformly 15 N/ 13 C-labeled and fractionally deuterated protein was prepared using 
medium with 75% 2 H 2 0. The protein was over-expressed largely in soluble form and 
purified by affinity chromatography on a nickel-IDA column (Invitrogen) followed by 
cleavage of the His 6 tag upon thrombin treatment. The cleaved protein contains an 
5 additional LVPR sequence at the C-terminus from the engineered thrombin site. The 
protein was unstable in its free form and quickly aggregated or became partially 
unfolded at room temperature. To ensure structural integrity, the protein was further 
subjected to a protein refolding procedure followed by ion-exchange chromatography. 
Synthetic peptides were prepared on a MilliGen 9050 peptide synthesizer (Perkin 
10 Elmer) using Fmoc/HBTU chemistry. NMR samples contained the SNT-1 PTB 

domain/hFGFRl peptide complex (1 : 1) of -0.5 mM in 100 mM phosphate buffer of 
pH 6.5, 5 mM DTT-d 10 and 0.5 mM EDTA in H 2 0/ 2 H 2 0 (9/1) or 2 H 2 0. 

NMR Spectroscopy: NMR spectra were acquired at 30°C on a Bruker DRX600 or 

15 DRX500 spectrometer. The backbone and side-chain ! H, 13 C and 15 N resonances of 
the protein were assigned using deuterium-decoupled triple-resonance experiments of 
HNCA, HN(CO)CA, HNCACB, HN(CO)CACB and (H)C(CO)NH-TOCSY 
[Yamazaki, et al.,J. Am. Chem. Soc. 116: 1 1655-1 1666 (1994); Sattler et al, Prog, in 
Nuclear Magnetic Resonance Spec. 4:93-158 (1999)] recorded using uniformly 

20 15 N/ 13 C-labeled and fractionally deuterated protein in complex with a non-isotopically 
labeled hFGFRl peptide. The side-chain assignments were completed using 3D 
HCCH-TOCSY [Clore, and Gronenborn, Meth. Enzymol 239:249-363 (1994)] data 
collected from a uniformly 15 N/ 13 C-labeled protein/non-labeled peptide complex 
NOE-derived distance restraints were obtained from 15 N- or 13 C-edited 3D NOESY 

25 spectra [Clore, and Gronenborn, Meth. Enzymol 239:249-363 (1994)]. jangle 

restraints were determined from V HN Ha coupling constants measured in a 3D HNHA-J 
spectrum [Clore, and Gronenborn, Meth. Enzymol 239:249-363 (1994)]. Slowly 
exchanging amide protons were identified from a series of 2D 15 N-HSQC spectra 
recorded after the H 2 0 buffer was changed to 2 H 2 0 buffer. The peptide resonances 

30 were assigned using 13 C/ 15 N-filtered 2D NOESY and TOCSY spectra [Sattler et al, 
Prog, in Nuclear Magnetic Resonance Spec. 4:93-158 (1999)] collected from a 
15 N/ 13 C-labeled protein/non-labeled peptide complex. The intermolecular NOEs used 
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in defining the structure of the SNT-1 PTB domain/hFGFRl complex were detected 
in 13 C- or 15 N-edited (F,) s 13 C/ 15 N-filtered (F 3 ) 3D NOESY spectra. All NMR spectra 
were processed with NMRPipe/NMRDraw [Delaglio et ah, J. Biomol NMR 6:277- 
293 (1995)] and analyzed using NMRView [Johnson and Blevins J. Biomol NMR 
5 4:603-614 (1994)]. Chemical shift assignments of the SNT PTB domain and the 
hFGFRl peptide have been deposited in the BioMagResBank (BMRB) under 
accession number 4790. 

Structure Calculations: Structures of the SNT-1 PTB domain in complex with the 

10 hFGFRl peptide were calculated with a distance geometry and simulated annealing 
protocol using the X-PLOR program [Briinger, X-PLOR Version 3.1: A system for X- 
Ray crystallography and NMR (Yale University Press, New Haven, CT, 1993)]. NOE 
distance and dihedral angle restraints were treated with a square-well potential of 50 
kcal mol" 1 A" 2 . A total of 2448 manually assigned NOE-derived distance restraints 

15 were obtained from the 15 N- or 13 C-edited NOESY data, including 251 intra-peptide 
and 258 intermolecular distance restraints. Additionally, 255 unambiguous and 52 
ambiguous distance restraints were identified from the NOE data by using ARIA 
[Nilges and O'Donoghue, Prog. NMR Spectroscopy 32:107-139 (1998)]. The final 
structure calculations employed a total of 2755 NOE restraints obtained from the 

20 manual and the ARIA-assisted assignments, 2703 of which were unambiguously 
assigned NOE-derived distance restraints that comprise of 1 072 intra-residue, 466 
sequential, 216 medium-range and 949 long-range NOEs. In addition, 70 
hydrogen-bond distance restraints for 35 hydrogen bonds and 19 <p-angle restraints 
were also used in the structure calculations. For the ensemble of the final 20 

25 structures, no distance or torsional angle restraint was violated by more than 0.4 A or 
5°, respectively. The total, distance violation and dihedral violation energies were 
262.0 ± 6.0 kcal mol" 1 , 74.4 ± 1.7 kcal mol* 1 and 0.82 ± 0.08 kcal mol" 1 , respectively. 
The Lennard- Jones potential, which was not used during any refinement stage, was 
-659.3 ±23.1 kcal mol" 1 for the final structures. Ramachandran plot analysis by 

30 Procheck-NMR [Laskowski et al., J. Biomol. NMR 8:477-486 (1996)] showed that in 
the final structures of the complex, 98.1% of the backbone geometries of the non-Gly 
and non-Pro residues in the complex (protein residues 18-116 and peptide residues 
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412-430), and nearly 100% in the secondary structure (protein residues 19-24, 35-40, 
45-49, 52-57, 63-68, 71-76, 85-90, 94-107 and 111-115; and peptide residues 
426-430) lie within energetically favorable or allowed regions. 

5 Mutagenesis and Yeast Two-Hybrid Binding Assays: The yeast two-hybrid binding 
studies of the SNT-1 PTB domain binding to hFGFRl or tyrosine-phosphorylated 
TRK were performed as described previously [Xu et al. 9 J. Biol. Chem. 273: 17987- 
17990 (1998)]. Briefly, SNT-1 cDNA fragments were cloned into the pACT2 
expression vector (Clontech) for expression as GAL4 DNA activation domain (AD) 

10 fusion proteins followed by C-terminal AU1 -epitope tags. The juxtamembrane region 
of hFGFRl was cloned into the pAS2-l expression vector (Clontech) for expression 
as a GAL4 DNA binding domain (BD) fusion protein. This plasmid served as a 
template for site-directed mutagenesis of hFGFRl using the QuikChange kit 
(Stratagene). DNA sequencing confirmed the mutations. AD and BD plasmids were 

1 5 co-transformed into Saccharomyces cerevisiae strain p J69-4 A and plated onto 

selective media. The synthetic medium lacking the amino acids Leu and Trp (Leu", 
Trp") selected for plasmid uptake. Medium lacking His, Leu and Trp (His", Leu", Trp") 
but containing 3 mM 3-aminotriazole was used to select for interaction of the AD and 
BD fusion proteins. Levels of protein interaction were scored by relative colony 

20 growth on these plates. Expression of the SNT-1 protein was confirmed by 

immunoprecipitation from yeast lysates using an anti-AUl monoclonal antibody 
(BAbCo). Western blotting was performed using an anti-AD antibody (Santa Cruz 
Biotech), goat anti-mouse IgG conjugated with horseradish-peroxidase, and 
developed by chemiluminescence. Similarly, expression of wild type and mutant 

25 hFGFRl was detected by immunoprecipitation with a rabbit polyclonal antibody 
specific for BD (Santa Cruz Biotech) and western blotting with mouse monoclonal 
anti-BD antibody (Santa Cruz Biotech). 

Ligand titration. Ligand titration experiments were performed by recording a series 
30 of 2D 15 N- and 13 C-HSQC spectra on the uniformly 15 N-, and i5 N/ 13 C-labeled SNT-1 
PTB domain (~0.3mM), respectively, in the presence of different amounts of the 
peptide ligands concentration ranging from 0 to ~2.0mM. The protein sample and the 
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stock solutions of the ligands were all prepared in the same aqueous buffer containing 
lOOmM phosphate and 5mM perdeuterated DTT at pH 6.5. 

Summary 

5 In an effort to understand the detailed molecular mechanisms by which SNTs regulate 
FGF and NGF receptor signaling, the three-dimensional structure of the SNT-1 PTB 
domain in complex with a peptide derived from the known SNT-1 binding site on the 
FGF receptor has been determined using NMR spectroscopy. On the basis of the new 
structural information, key amino acid residues are disclosed that are important for the 
10 SNT-1 PTB domain interactions with the TRKA and FGF receptors. 

To examine the SNT-1 PTB domain binding to the peptides derived from tyrosine- 
phosphorylated TRKA and non-phosphorylated FGF receptors, the receptor peptides 
were titrated into the protein solution and the chemical shift perturbations of the 

15 protein backbone amide resonances in the HSQC spectra were monitored. The salient 
inferences emerged from the NMR titration data are that the FGF and TRKA receptor 
peptides bind to the PTB domain in specific but distinct manners. This conclusion is 
supported by the observation that distinct sets of amino acid residues undergo 
chemical shift perturbation upon binding to the different peptides in the NMR titration 

20 (Fig. 3B). In addition, binding to the FGFR peptide results in more residues to 

undergo chemical shift changes than the TRKA binding. Taken together, these results 
clearly indicate that the SNT-1 PTB domain employs two different mechanisms in its 
interactions with the FGF and TRKA receptor peptides. 

25 The human sue- 1 -associated neurotrophic (SNT) factor target, GenBank Accession 
No. 5730057, has the amino acid sequence of SEQ ID NO: 1 . The mouse {Mus 
musculus) fibroblast growth factor receptor-1, GenBank Accession No. AAC52182, 
has the amino acid sequence of SEQ ID NO:2. The protein fragments employed in 
the NMR structural determination (summarized in Tables 1-5) were the PTB Domain 

30 of SNT-1 comprising amino acid residues 11-140 of SEQ ID NO:l, and a small 
fragment of FGFR1 comprising amino acid residues 409-430 of SEQ ID NO:2, 
[HSQMA VHKLA KSIPL RRQVT VS (SEQ ID NO:3)]. The NMR spectra were 
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carried out at a concentration of 0.5 mM SNT-1/FGFR1 in a 1:1 complex in a 0.1 M 
sodium phosphate buffer pH 6.5 containing 5 mM DTT and 1 mM EDTA. The 
results of the NMR analysis are included below in Tables 1-5. The depiction of the 
structure is shown in Figure 2. 

5 

Through the use of the information disclosed herein chemical compounds that can 
inhibit or block the SNT-1 PTB domain interaction with FGF receptor or the NGF 
TRKA receptor have been designed and examined. One such compound is a tyrosine- 
phosphorylated peptide having the amino acid sequence of L VI AGNP Ap YRS , SEQ 

10 ID NO:4 (where pY stands for phosphotyrosine). This peptide can bind to the SNT-1 
PTB domain with high affinity of K d ~ 1/iM. The affinity of the SNT-1 PTB domain 
binding for this peptide is about 10-fold higher than that for the FGFR derived peptide 
having the amino acid sequence of SEQ ID NO:3. Furthermore, this peptide partially 
overlaps with the binding of the FGFR peptide having the amino acid sequence of 

15 SEQ ID NO:3 on the SNT-1 PTB domain. These results suggest that this tyrosine- 
phosphorylated peptide could act as a competitive inhibitor to block the interaction 
between the SNT-1 PTB domain and FGFR, and thus could serve as basis for further 
development of small molecule mimetics that inhibit this interaction in cells. 

20 The present study has also identified several key amino acid residues in both the SNT- 
1 PTB domain and FGFR1 that are important for the SNT-1 PTB domain interactions 
with the FGF receptor or the NGF TRKA receptor. The following amino acid 
residues were identified in the SNT-1 PTB domain of SNT-1 (having the amino acid 
sequence of SEQ ID NO:l): Asp 28, Asp 29, Phe 89, Val 112, Glu 114, and Glu 119 

25 

In the SNT-1 PTB binding domain of FGFR 1 of the FGF receptor having the amino 
acid sequence of SEQ ID NO:2 the following amino acid residues were identified: 
Val 414, Leu 417, He 421, Leu 423, Arg 425, Val 427, and Val 429. 

30 Results 

Structure of the SNTPTB Domain: Nuclear magnetic resonance (NMR) studies were 
conducted using a 1:1 complex of SNT-1 PTB domain (residues 1 1-140 of SEQ ID 
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NO:l) and a 22-residue peptide derived from the juxtamembrane region of hFGFRl 
(residues 409-430 of SEQ ID NO:2) (Figures 1A and IB). The dissociation constant 
(K D ) of the protein/peptide complex was estimated to be -10 |iM using the isothermal 
titration calorimetry (ITC) technique. This result is consistent with the interaction 
5 being in slow to intermediate exchange on the NMR timescale in NMR titration 

experiments. The well-defined structure of this complex was determined from a total 
of 2844 NMR-derived restraints (Figure 2 A). The protein structure consists of a 
p-sandwich containing two nearly orthogonal, antiparallel p-sheets capped at one end 
by an amphipathic a-helix (Figures 2B and 2C), as anticipated from a classical PTB 

10 domain. The SNT-1 structure, however, possesses several unforeseen features that are 
unique for the conserved PTB domain scaffold. First, unlike all other known PTB 
domain structures that end with a C-terminal a-helix [Forman-Kay and Pawson, Curr. 
Opin. Struct. Biol 9:690-695 (1999)], the SNT-1 PTB domain has an additional 
P-strand (p8) extending from its C-terminal a-helix (al) that molds the hFGFRl 

1 5 peptide into the second antiparallel p-sheet (see below). Second, boundaries of 

secondary structure elements between SNT and IRS PTB domains do not necessarily 
coincide with their amino acid conservation, such as the conserved VEE motif of p8 
(residues 113-115), suggesting that tertiary interactions are very important in defining 
structure elements. Third, a C-terminal portion of the SNT-1 construct (residues 

20 1 16-136) (Figure 1 A), which is highly homologous (-45% identity) to al in the IRS-1 
PTB domain, is largely structurally disordered. The loss of helical conformation is 
perhaps due to the presence of Pro residues and change of amphipathic nature of the 
sequence, which could disrupt helical propensity and alter interactions with other 
parts of the protein, respectively. While reasons for the conformational discrepancy 

25 between these homologous sequences are evident from structural analysis, the 

functional implications of their evolutionary relationship are not clear. Finally, the 
sequence comprising residues 94-107 in SNT-1, predicted to be a large insert from 
sequence homology alignment with IRS proteins, actually forms an a-helix (al) that 
blocks one side of the P-sandwich. Together, these unique structural features of the 

30 SNT-1 PTB domain may confer its distinct function. 



Interactions between hFGFRl Peptide and the SNT PTB Domain: The hFGFRl 
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peptide wraps around the protein molecule with an unusual backbone conformation 
containing two nearly 90° turns that are oriented orthogonal to each other (Figures 2B 
and 2C). The peptide interacts extensively with the protein by clasping both sides of 
the p-sandwich (Figure 2D). The estimated surface area of SNT-1 buried by the 
5 bound peptide is -2025 A 2 , with 18 of the 22 peptide residues displaying 

intermolecular NOEs to many protein residues (Figure IB). The C-terminal QVTVS 
segment of the peptide (residues 426-430) adopts an antiparallel p-strand (P') 
sandwiched between p5 and P8. Two intermolecular hydrogen bonds bridging p' and 
P5, and a large number of NOEs characteristic of the antiparallel p-sheet are observed 

10 between backbone atoms of the complex (Figure 4A). Side-chains of Val-427 and 
Val-429 interact extensively with Leu-62, Tyr-65, Phe-98, Met- 105, Ile-1 10 and 
Val-1 12 in a hydrophobic core formed between P5 and al (Figure 4B). The peptide 
fastens onto the other side of the P-sandwich by embedding its N-terminal MAVH 
segment (residues 412-415) into a large hydrophobic cavity bounded by the three 

15 loops connecting pl/p2, p3/p4 and p6/p7. In particular, methyl groups of Val-414 are 
completely immersed in an aromatic pool of Trp-57, Phe-74, Phe-87 and Phe-89, and 
also contact Leu-47, Val-55 and Thr-82 (Figure 4C). Moreover, Leu-417, Ile-421 and 
Leu-423 located in the center of the peptide bind to otherwise solvent-exposed 
hydrophobic residues Leu-71, Ile-86 and Ala-88 on the surface of the second P-sheet 

20 (Figure 4D). In addition to hydrophobic interactions, complementary electrostatic 
interactions are observed, largely localized at the two turns in the peptide. At one 
turn, Arg-424 pairs with Asp-68, while Arg-425 interacts with Glu-114 and Glu-1 19. 
At the other turn, Lys-416 and Lys-419 show interactions with a contiguous patch of 
three solvent-accessible residues Asp-27, Asp-28 and Asp-29 (Figure 4E). These 

25 results indicate that both hydrophobic and electrostatic interactions are important for 
SNT-1 and hFGFRl recognition. 

To determine the relative contributions of specific hFGFRl residues to SNT-1 
recognition, site-directed mutagenesis was used to alter the peptide residues that show 
30 intermolecular NOEs to the protein (Figure IB). The resulting hFGFRl mutants were 
analyzed for SNT-1 interaction by yeast two-hybrid binding assays. Substitution of 
Ala for either Leu-423 or Val-429 completely eliminated peptide binding to the 
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SNT-1 PTB domain, while mutation of Val-414, Leu-417, Arg-425 or Val-427 to Ala 
significantly reduced binding (Figure 6A). The reduced protein-protein interactions 
of the individual FGFR1 mutants were not likely due to variations of protein 
expression in the yeast cells (Figure 6A). Particularly, expression levels of the 
5 Leu-423-Ala and Val-429-Ala mutants were at least as good as that of the wild type. 
These mutagenesis data agree with (1) the observed intermolecular NOEs (Figure 
IB); (2) the calculated solvent accessible surface area (particularly for the peptide 
hydrophobic residues); and (3) amino acid conservation of the juxtamembrane region 
of the FGFR family (Figure IB). In addition, these data are also consistent with 

10 results of mutational analysis of FGFR1 and SNT-1 interactions [Ong et al, 

MolCellBiol 20:979-989 (2000)]. Collectively, these results demonstrate that the 
nature of the protein and peptide binding is highly specific and extensive, with 
multiple types of interactions stabilizing the complex. It is interesting to note that 
utilization of the hydrophobic side of the P-sandwich opposite to al for protein 

15 interactions in SNT-1 is atypical and not seen in other PTB domains [Forman-Kay 
and Pawson, Curr. Opin. Struct. Biol. 9:690-695 (1999)]. Amino acid residues in the 
corresponding loops, however, have been shown to interact with phospholipids in the 
PTB domain of She and in the structurally homologous PH domains of several 
signaling molecules [Lemmon et al, Cell 85:621-624 (1996)]. Remarkably, SNT-1 is 

20 also capable of interacting with tyrosine-phosphorylated TRKs, which possess no 
sequence homology to hFGFRl (see below), illustrating the unique functional 
diversity of this conserved PTB domain fold. 

Structural Insights into SNT Binding to Tyrosine-Phosphorylated TRKs: The new 
25 SNT-1 PTB domain structure yields insights into how the protein might interact with 
tyrosine-phosphorylated TRKs. Conserved Arg-63 and Arg-78 residues in SNT-1 are 
structurally analogous to Arg-212 and Arg-227 of IRS- 1 PTB domain, respectively 
(Figures 6B and 6C). The latter pair of Arg residues are essential for IRS-1 binding to 
phosphotyrosine in the canonical NPXpY motif [Zhou et al, Nat. Struc, Biol. 3:388- 
30 393 (1996)]. The mutagenesis and yeast two-hybrid binding studies disclosed herein 
show that the mutation of Arg-63 and Arg-78 to Gin results in complete elimination 
of SNT-1 interaction with TRKB, but has no effect on binding to hFGFRl. 
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Additionally, Ala substitution of He (pY-5) in the TRKA peptide (HIIENPQpYFSDA, 
which is highly homologous to TRKB) caused marked reduction in binding to the 
PTB domain. Based on these results, the mechanism by which SNT-1 binds to TRKs 
is appears similar to that of IRS-1 PTB domain interactions with NPXpY-containing 
5 proteins (Figure 6C). Specifically, the phosphotyrosine of the NPXpY motif in TRKs 
would coordinate with Arg-63 and Arg-78 in SNT-1 PTB domain, and residues 
N-terminal to the phosphotyrosine would adopt an extended conformation with 
hydrophobic side-chains intercalating into the hydrophobic pocket between (35 and al, 
which is also a site for interactions with hFGFRl . Furthermore, NMR titration 
10 experiments showed that hFGFRl and TRK peptides compete for binding to the PTB 
domain, which agrees with the results of the peptide competition experiments in a 
SNT-1 GST pull-down assay [Ong et aL, MolCell.BioL 20:979-989 (2000)]. 
Together, these results demonstrate that the binding of SNT-1 to either 
non-phosphorylated FGFRs or tyrosine-phosphorylated TRKs is mutually exclusive. 

15 

Regulation of SNT and FGFR Association by a Possible Local Conformational 
Change: The (38 strand is structurally unique for the PTB domain fold. It was 
therefore determined whether it was functionally important for SNT-1 binding to 
hFGFRl or phosphorylated TRKs. Truncation studies of SNT-1 were therefore 

20 performed to address this question. Yeast two-hybrid assays showed that a truncated 
SNT-1 PTB domain lacking the p8 region (residues 2-111) almost abolished its ability 
to interact with hFGFRl , without decreasing its TRKB binding (Figure 6C). Another 
SNT-1 truncation mutant (residues 11-114), which ends with (38 shortened by one 
residue, showed markedly reduced binding to hFGFRl peptide in NMR binding 

25 studies (supported by significant line-broadening of the protein NMR signals), but did 
not impair its interactions with the tyrosine-phosphorylated TRKB peptide. The 
effects of the (38 truncation were further confirmed in ITC measurements of the SNT 
PTB domain binding to hFGFRl or TRK peptides. These results indicate that both 
the presence and structural integrity of the (38 region are necessary for SNT-1 binding 

30 to FGFRs but not for its interaction with TRKs. While the overall structural fold of 
the SNT-1 PTB domain may be similar in its free, TRK- or hFGFRl -bound forms, the 
structural requirement of (38 is unique for its hFGFRl association. This observation is 
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consistent with the fact that the anti-parallel (3' strand of the hFGFRl peptide (7 
residues, Figure 6A) is much longer than these NPXpY or related sequences that are 
recognized by the PTB domains of She [Zhou et al , J. Biol. Chem. 270:31119-31123 
(1995)], IRS-1 [Zhou et al, Nature Struct Biol 3:388-393 (1996), Eck et al, Cell 
5 85:695-705 (1996)], or Numb [Zwahlen et al., EMBO .719:1505-1515 (2000)]. 
These findings imply that conformational perturbation of p8 would compromise 
SNT-1 binding to FGFRs, indicating that this p8 strand could act as an on/off switch 
for SNT-1 /FGFR association. 

1 0 Possible Role ofSNTs as Molecular Switches in FGFR and TRK Signaling: The 
present structural and mutational analyses of the SNT-1 PTB domain suggest 
mechanisms by which SNTs coordinate FGF and neurotrophin signaling during 
neuronal differentiation. The ability of SNTs to interact with non-phosphorylated 
FGFR raises the possibility that SNTs are sequestered by FGFRs in unstimulated cells 

1 5 and are only available for activation by FGFs. Events that trigger release of SNTs 
from constitutive FGFR association would allow for SNT interaction with TRKs, 
leading to a neurotrophin-responsive state in differentiating neurons. Such events 
may include FGF receptor down-regulation or conformational perturbation to the p8 
region of the SNT PTB domain by post-translational modifications or interactions 

20 with other protein(s). Alternatively, even in the absence of SNT/FGFR complexes, 
the conformational flexibility of the SNT PTB domain may be constrained during 
neurogenesis to regulate its availability for interaction with and activation by 
neurotrophin receptors. These models may provide new clues to explain how 
differentiating neuronal precursors undergo the well-documented switch from FGF 

25 dependence to neurotrophin dependence [Birren and Anderson Neuron 4:189-201 
(1990); Ip et al, Neuron 13:443-455 (1994); and Stemple et al, Neuron 1:517-525 
(1988)], which is not simply due to changes in TRK expression [Ip et al, Neuron 
13:443-455 (1994)]. 

30 Conclusion 

The new three-dimensional structure of the SNT-1 PTB domain/hFGFRl complex 
reveals the unique features that enable SNT-1 to recognize two radically different 
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receptor sequences in a mutually exclusive manner. The results demonstrate that both 
adaptive hydrophobic interactions as well as complementary electrostatic interactions 
are important factors that underlie specificity and versatility of molecular recognition 
by the conserved PTB domain structural fold. These findings further indicate that 
5 cellular events, which cause a local conformational change in the PTB domain, may 
govern SNT interactions with either FGFRs or TRKs. Thus, the intrinsic adaptability 
and flexibility of the SNT PTB domain appears to serve as a focal point for the 
essential interplay between FGF and neurotrophin receptor signaling that governs 
neuronal survival and differentiation. 

10 

The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and the accompanying figures. Such modifications are intended to fall 
1 5 within the scope of the appended claims. 

It is further to be understood that all base sizes or amino acid sizes, and all molecular 
weight or molecular mass values, given for nucleic acids or polypeptides are 
approximate, and are provided for description. 

20 

Various publications are cited herein, the disclosures of which are hereby incorporated 
by reference herein in their entireties. 



